-
Notifications
You must be signed in to change notification settings - Fork 241
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Filesize zero error while scanning the git repo with talisman #374
Comments
Can you share the steps to reproduce the error? |
@venkatn087 : Are there size 0 files in the repository ? Can you set TALISMAN_DEBUG or execute talisman binary with |
@venkatn087 @svishwanath-tw I am also getting out of memory error when running the scan |
@nasreenwahab : Thanks for the update. I'm not convinced you and @venkatn087 are facing the same underlying issue. From what little I've worked on the codebase, I remember that files once read remain in memory. I have some follow up questions to plan next course of action: The scan feature was built in a haste and its repercussions are showing now. |
Additional complications to factor in. If the decision is to go with blocked reads of files, how to deal with secrets at the edges of a block ? This shouldn't (usually) be a problem with source code. Unless talisman is being used to scan individual files with more than a 100 thousand lines of code. (100K is an arbitrary number) How many files do should be maintain in memory ? One early request from promoters of talisman within thoughtworks was to display the actual secret in the output (which was later limited to upto 50 chars of output with a ... overflow indicator) . I believe this to be the reason to hold blobs in memory after the detector chain execution completes. A (comparatively) recent re-factoring I made prevented git sub-process proliferation by using a single sub-process to read contents using batch mode of The number of variables to consider are huge and I believe going forward without data is not going to yield good results. |
@jmatias : FYI |
Hi Team,
I am getting an Error message while scanning my github repo as mentioned below. Could you please let me know why as i getting an error?
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
ERRO[0011] error reading filesize: EOF
thanks
venkat
The text was updated successfully, but these errors were encountered: