any way we can reduce the hash length for attached files w/ export code?

aaronkyle · November 8, 2022, 9:16am

Hi Team.

[not sure if this is site feedback or community help]

I am currently trying to upload the exported files from a notebook, which has attached files. The hashes used for the attached files are very long, and when I upload them to either MS Azure or AWS S3 (haven’t tried GitHub yet), the names keep getting truncated and, consequently, the files aren’t being referenced correctly when called from the index.html doc. Any way to get around this?

btw: here’s an example of what happens when one just trying to upload these files:

AWS:
https://s3.amazonaws.com/data.visualization/nepal-household-facilities-2011/index.html

Azure:
https://obsvrdemo.blob.core.windows.net/$web/index.html

(same problem on both platforms)

mootari · November 8, 2022, 4:00pm

Where do the names get truncated? That is, at/after which step?

aaronkyle · November 8, 2022, 4:18pm

Upon upload to AWS or Azure. And both truncate in the exact same way.

mootari · November 8, 2022, 4:56pm

Can you share an example of original name vs truncated name?

aaronkyle · November 8, 2022, 5:10pm

Absolutely

Original:
f5ab26c5b8b549826421a2d1975bb57797588267cf56229d9f12ea8ff07ad9334a3f94a5928fc8c4d85f442cb3dc4603b9affdad3460c1ce6d786cb05d9432b1.zip

Truncated:
https://s3.amazonaws.com/data.visualization/nepal-household-facilities-2011/files/F5AB26~1.ZIP

mootari · November 8, 2022, 6:10pm

I don’t think that’s either AWS’ or Azure’s fault. I suspect that your OS may have already shortened the name. Are you on Windows? Can you inspect the file’s properties and share a screenshot of that?

aaronkyle · November 8, 2022, 11:49pm

Sorry for the slow reply. The name is not truncated after untar in my file explorer. The change only is only apparent after upload.

…But indeed you are correct that it has something to do with Windows After switching to Linux and uploading, everything works well:

As for Windows: I am perplexed. When I look at the directory, I see full names. When I look at a file’s properties, I see the same truncating pattern in the file path (file name wasn’t shown in properties) … so what I see is not what I get.

I am very appreciative that this is solvable and very grateful that you showed me how to do so, but I am lost as to why this would look one way in the file Explorer but be interpreted entirely differently behind the scenes.

mootari · November 9, 2022, 5:20am

How do you upload those files in Windows? Do you have your Azure / S3 cloud storage mounted somehow and then drag them over?
Do shorter file names change as well (if they’re longer than 11 characters including extension)?

aaronkyle · November 9, 2022, 5:23am

I use the web UI in both instances and drag and drop. I did the same on Linux.
No, all hashed js files that arent’ in the files folder in the notebook archive do not transform

mootari · November 9, 2022, 6:23am

I’m not sure I understand. Do you mean to say that you dragged the files directly from within the archive, without extracting it first?

aaronkyle · November 9, 2022, 6:27am

No, I extracted the archive, them dragged and dropped. The image above in Windows shows the extracted files. In the file explorer, these all read with their full file names. However, dropping them into the AWS and Azure web UIs, they get truncated.

Performing this operation in the exact same way (except that I used tar xvf to unzip, rather than 7zip), I get the truncated names. It’s baffling.

mootari · November 9, 2022, 6:33am

I assume the same thing happens when you attach one of these files to a notebook?

Edit: You can also try the following. Add these two cells to a notebook:

viewof file = Inputs.file()

file?.name

Then choose one of those files.

aaronkyle · November 9, 2022, 6:35am

I have not tried creating a notebook and attaching the files. These are downloaded from a notebook using ‘export code’. The hashes are Observable transformations of the filename… I assume using SHA or equivalent… and I assume to prevent storage of duplicate files.

Let me try…

EDIT:
Yes, when I drag and drop into the file attachments, same truncating behavior occurs. Same when I use the input button. Though when I select the file, the full name is given in the file selection interface.

aaronkyle · November 9, 2022, 6:57am

Windows must have some strange ‘aliasing’ behavior that shows a different name compared to how it is “actually” stored:

-and-

Topic		Replies	Views
Cloud files and unique names Notebooks	4	41	September 9, 2024
What's your method to compress FileAttachements? Feedback	0	865	April 17, 2020
Attribution for attached files Feedback	1	421	January 2, 2020
FileAttachments UX suggestions Feedback	2	375	April 28, 2020
Exporting with file attachments? Community Help	3	657	May 23, 2021

any way we can reduce the hash length for attached files w/ export code?

Related topics