Extract and display original document filenames #452

rmol · 2019-06-27T22:17:33Z

Description

Files submitted to the SecureDrop server are gzipped with their
original name, then encrypted with GPG. When downloaded, we can
extract the original filename from the gzip header, so they can be
opened with an appropriate application.

This change adds the db.File.original_filename column, and populates
it when downloading and decrypting files. The file is still stored as
it has been, under the db.File.filename value stripped of extensions,
but when opened, that file is first hard linked to the original
filename.

Since we can't wait for the file to be opened (starting the disposable
VM is pretty slow), we can't immediately remove that link, so
storage.delete_single_submission_or_reply_on_disk now cleans it up if
the object being deleted has original_filename populated.

Fixes #163.

Test Plan

Check out this branch. Run the client. Upload a file with a .txt extension via the source interface, and confirm that when you refresh the client, click on the source, then click on the Download item, that the name changes to the name of the file as you uploaded it.

Then check that clicking on the file again opens it up in a suitable application in the disposable VM.

For bonus points, delete the source, or at least the submission, and confirm that upon refresh the data directory is properly cleaned up -- there should be no files with names matching the File's filename or original_filename.

Checklist

If these changes modify code paths involving cryptography, the opening of files in VMs, network (via the RPC service) traffic, or fine tuning of the graphical user interface, Qubes testing is required. Please check as applicable:

I have tested these changes in Qubes
I do not have a Qubes OS workstation (the reviewer will need to test these changes in Qubes)

Files submitted to the SecureDrop server are gzipped with their original name, then encrypted with GPG. When downloaded, we can extract the original filename from the gzip header, so they can be opened with an appropriate application. This change adds the db.File.original_filename column, and populates it when downloading and decrypting files. The file is still stored as it has been, under the db.File.filename value stripped of extensions, but when opened, that file is first hard linked to the original filename. Since we can't wait for the file to be opened (starting the disposable VM is pretty slow), we can't immediately remove that link, so storage.delete_single_submission_or_reply_on_disk now cleans it up if the object being deleted has original_filename populated.

redshiftzero

two quick notes

Makefile

securedrop_client/gui/widgets.py

securedrop_client/api_jobs/downloads.py

sssoleileraaa · 2019-07-01T18:30:03Z

securedrop_client/api_jobs/downloads.py

@@ -166,7 +168,7 @@ def call_download_api(self, api: API, db_object: Reply) -> Tuple[str, str]:
        sdk_object.source_uuid = db_object.source.uuid
        return api.download_reply(sdk_object)

-    def call_decrypt(self, filepath: str, session: Session = None) -> None:
+    def call_decrypt(self, filepath: str, session: Session = None) -> str:


It would be helpful to update the comment for this method in the DownloadJob base class to explain what kind of string it should return since calling decrypt will return both the original filename and filepath strings. It would be helpful to know that we want to get the decrypted original filename so we can store it in the local db.

Agreed. Whatever we decide to return from decrypt_submission_or_reply should be explained here.

securedrop_client/crypto.py

sssoleileraaa

lgtm with a few nits

securedrop_client/crypto.py

sssoleileraaa · 2019-07-01T21:38:55Z

Works as advertised, but one last question: Is there a reason to not show file extension? It seems like it would be more useful to show it so the journalist knows what they're downloading. An image, pdf, etc.

sssoleileraaa · 2019-07-01T21:43:07Z

ah, I know you were unable to access zeplin before, but it looks like the prototypes show file extension (see below)

https://app.zeplin.io/project/5c807ea562f734bd2756b243/screen/5cd3b87518e9f734ae0218bb

alembic/versions/bafdcae12f97_.py

rmol requested review from sssoleileraaa, heartsucker and redshiftzero as code owners June 27, 2019 22:17

redshiftzero reviewed Jun 27, 2019

View reviewed changes

Makefile Outdated Show resolved Hide resolved

securedrop_client/gui/widgets.py Outdated Show resolved Hide resolved

Address review; fix mypy error

5526501

sssoleileraaa reviewed Jul 1, 2019

View reviewed changes

securedrop_client/crypto.py Outdated Show resolved Hide resolved

sssoleileraaa approved these changes Jul 1, 2019

View reviewed changes

sssoleileraaa reviewed Jul 1, 2019

View reviewed changes

securedrop_client/crypto.py Outdated Show resolved Hide resolved

Incorporate suggestions from review

6c9f35b

sssoleileraaa reviewed Jul 1, 2019

View reviewed changes

alembic/versions/bafdcae12f97_.py Outdated Show resolved Hide resolved

Add down migration

29d20a6

sssoleileraaa merged commit 642a8b2 into freedomofpress:master Jul 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract and display original document filenames #452

Extract and display original document filenames #452

rmol commented Jun 27, 2019

redshiftzero left a comment

sssoleileraaa Jul 1, 2019

rmol Jul 1, 2019

sssoleileraaa left a comment

sssoleileraaa commented Jul 1, 2019

sssoleileraaa commented Jul 1, 2019 •

edited

Loading

Extract and display original document filenames #452

Extract and display original document filenames #452

Conversation

rmol commented Jun 27, 2019

Description

Test Plan

Checklist

redshiftzero left a comment

Choose a reason for hiding this comment

sssoleileraaa Jul 1, 2019

Choose a reason for hiding this comment

rmol Jul 1, 2019

Choose a reason for hiding this comment

sssoleileraaa left a comment

Choose a reason for hiding this comment

sssoleileraaa commented Jul 1, 2019

sssoleileraaa commented Jul 1, 2019 • edited Loading

sssoleileraaa commented Jul 1, 2019 •

edited

Loading