Ok going though some of the downloaders to see if the effects of dup filtering are showing in a major way yet and noticed this
Got this error from this image
https://i.4cdn.org/aco/1555128352927.png
'PngStream' object has no attribute 'chunk_eXIf'… (Copy note to see full error)
Traceback (most recent call last):
File "include\ClientImportFileSeeds.py", line 1246, in WorkOnURL
self.DownloadAndImportRawFile( file_url, file_import_options, network_job_factory, network_job_presentation_context_factory, status_hook )
File "include\ClientImportFileSeeds.py", line 571, in DownloadAndImportRawFile
self.Import( temp_path, file_import_options )
File "include\ClientImportFileSeeds.py", line 790, in Import
( status, hash, note ) = HG.client_controller.client_files_manager.ImportFile( file_import_job )
File "include\ClientCaches.py", line 1132, in ImportFile
file_import_job.GenerateInfo()
File "include\ClientImportFileSeeds.py", line 293, in GenerateInfo
self._thumbnail = HydrusFileHandling.GenerateThumbnailBytes( self._temp_path, bounding_dimensions, mime, percentage_in = percentage_in )
File "include\HydrusFileHandling.py", line 81, in GenerateThumbnailBytes
thumbnail_bytes = GenerateThumbnailBytesFromStaticImagePath( path, bounding_dimensions, mime )
File "include\ClientImageHandling.py", line 295, in GenerateThumbnailBytesFromStaticImagePathCV
numpy_image = GenerateNumpyImage( path, mime )
File "include\ClientImageHandling.py", line 57, in GenerateNumpyImage
numpy_image = GenerateNumPyImageFromPILImage( pil_image )
File "include\ClientImageHandling.py", line 131, in GenerateNumPyImageFromPILImage
s = pil_image.tobytes()
File "site-packages\PIL\Image.py", line 749, in tobytes
File "site-packages\PIL\ImageFile.py", line 252, in load
File "site-packages\PIL\PngImagePlugin.py", line 680, in load_end
File "site-packages\PIL\PngImagePlugin.py", line 140, in call
AttributeError: 'PngStream' object has no attribute 'chunk_eXIf'
and yes, dup detecting is showing some results which is great, beyond the few gigs I have gotten back.
However, I also see a potential issue. I have brought this up in the past,
https://boards.4chan.org/trash/thread/22573284
the cyoa threads, I honestly like these and think they are fun to go though. however when one comes up in dup detector, I tend to skip it, and seeing a few of the pages that have files removed due to dup processing, I made the right call.
Is it possible for a file removed due to dup processing to link back to the file that won? I mean if this is a no-low effort thing, this would be fantastic, it would effectively allow a thread watcher to present all the files with the best versions of said files shown if that's how you wanted to display them.
but honestly if this would be a high effort endeavor, I think you likely have better things to do.