I save every /v/ webm thread i see, using DownthemAll to save things with their original filenames. I've discovered, to my surprise, that 8chan will serve you two versions of the same file if you click and drag over the entire page with DtA, one with metadata and one without, which is interesting. Really hoping that Hydrus dupe-finder rolls out to videos soon, so i can trim all this down and merge a few things together.
As of right now, 158,937 images and videos, coming up to 400GB in file size, but i guess that's my fault for deliberately going for large, oversized webms rather them tiny images. I've also started converting gifs that don't move into jpg and gifs that do and are also over 1mb in size to webm, which is a bit of a side project right now.
I commit all the tags with a space after the namespace. That's my fault, sorry. The Filename: tag is totally ruined thanks to my work. Due to a personal problem with unnamespaced tags, i've completely removed them all from my personal database. Every single tag in my database has a namespace to match. Right now, my list of usable namespaces is as follows:
Aircraft: (Which by itself, is parented to Board:/k/. I use it for anything that has an aeroplane in it, and if i don't recognize it, i use Aircraft: tagme.)
Art: (Which is full of my wallpapers, as well a lot of /tg/ character and landscape art. Has a lot of Tagme for some reason)
Board: (All of which are chan boards, like /k/, /pol/, or /v/. Used as both a place for board culture stuff as well as more generic potholes for things that i should post in those boards and only those boards)
Booru: (Most of which has been too much trouble to remove, so i've just kept it from the public tag repo)
Chans: (Like board, but for specific chans. Been thinking about changing it to Website: Because i've already got a chan:reddit knocking about)
Chapter: (To sort big things where page: just isn't enough. I use it for suits in a deck of tarot cards(Clubs, hearts, wands, etc) and for seasons in long running shows, like ATHF)
Characters: (Both fictional and nonfictional, merging the Person: tag on the repo into it. Weather they go into Character: or Creator: is on a case by case basis)
Country: (Which i just made because i'm sick and tired of not having something to tag /pol/ memes with)
Creator: (Same as Character: but entirely for the people who create things. I also like to use it to narrow down a Character: tag. For instance, Creator: Moonman refers exclusively to any music track with Moonman in it's name but Character: Moonman can also refer to things like memes and reaction images that bear his likeness.)
Emotion: (Which is a generic pothole for reaction images for whatever purpose.)
Filename: (Which is by far the biggest one on here, with 72000 files tagged with it in some way. Fairly easy to manage, though. I can always tell which ones are mine and which ones are automated or taken from the repo, because mine have a space in between the file and the namespace. And now i know, having said that, that somebody's going to start copying me and doing that for themselves, and it'll all be ruined.)
Firearm: (Just like Aircraft: but for funs)
Genre: (Which is parented to Medium: music. I don't use it all that often, and the ones i have could be all moved to memes without issue. Stuff like Vaporwave and Synthwave, they have a style beyond just the music.)