/hydrus/ - Parsing scripts

/hydrus/ - Hydrus Network

Archive for bug reports, feature requests, and other discussion for the hydrus network.

Mode: Reply

Name
Options
Subject
Message	Max message length: 12000
files	Drag files here to upload or click here to select them 0.00 / 50.00 MB Max file size: 32.00 MB Total max file size: 50.00 MB Max files: 5 Supported file types: GIF, JPG, PNG, WebM, OGG, and more

E-mail
Password	(used to delete files and posts)
Misc

Remember to follow the Rules

The backup domains are located at 8chan.se and 8chan.cc. TOR access can be found here, or you can access the TOR portal from the clearnet at Redchannit 3.0.

Board Locked? Request Reopening

APNG and GIF uploads are temporarily disabled while we deal with a spammer problem.

8chan.moe is a hobby project with no affiliation whatsoever to the administration of any other "8chan" site, past or present.

Parsing scripts Anonymous 11/14/2016 (Mon) 18:14:13 Id: f047d8 No. 4475

How about a thread for discussing/creating/sharing parsing scripts? I made one for md5 lookup on e621.net (actually I just modified Hydrus_dev's danbooru script). Let me know if I did anything wrong with it, I'm pretty clueless… but it seems to work fine.


[32, "e621 md5", 1, ["http://e621.net/post/show", 0, 1, 1, "md5", {}, [[30, 1, ["we got sent back to main gallery page -- title test", 8, [27, 1, [[["head", {}, 0], ["title", {}, 0]], null]], [true, true, "Image List"]]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-general"}, null], ["a", {}, 1]], null]], ""]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-copyright"}, null], ["a", {}, 1]], null]], "series"]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-artist"}, null], ["a", {}, 1]], null]], "creator"]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-character"}, null], ["a", {}, 1]], null]], "character"]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-species"}, null], ["a", {}, 1]], null]], "species"]], [30, 1, ["we got sent back to main gallery page -- page links exist", 8, [27, 1, [[["div", {}, null]], "class"]], [true, true, "pagination"]]]]]]

Anonymous 11/14/2016 (Mon) 18:37:52 Id: f047d8 No. 4476

Oops, looks like I spoke too soon. My e621 script only works if the file actually exists on the site, if it doesn't it appears that the e621 API sends an error code via the HTTP status code and that makes Hydrus think the script failed, resulting in an error message pop-up. I don't think you can set Hydrus to ignore error messages at the moment so my script is useless. Anyone know how to fix?

Anonymous 11/15/2016 (Tue) 20:17:50 Id: f047d8 No. 4484

Here's a script for md5 lookup on rule34.xxx


[32, "rule34.xxx md5", 1, ["http://rule34.xxx/index.php", 0, 1, 1, "md5", {"s": "list", "page": "post"}, [[30, 1, ["pagination found", 8, [27, 1, [[["body", {}, 0], ["div", {"id": "content"}, 0], ["div", {}, 0]], "id"]], [true, true, "post-list"]]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-general"}, null], ["a", {}, 0]], null]], ""]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-copyright"}, null], ["a", {}, 0]], null]], "series"]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-artist"}, null], ["a", {}, 0]], null]], "creator"]], [30, 1, ["", 0, [27, 1, [[["li", {"class": "tag-type-character"}, null], ["a", {}, 0]], null]], "character"]]]]]

hydrus_dev Board Owner 11/15/2016 (Tue) 23:31:43 Id: f047d8 No. 4488

>>4475 >>4476 I'll check this out tomorrow and see if I can sensibly catch the error code!

hydrus_dev Board Owner 11/16/2016 (Wed) 17:39:11 Id: f047d8 No. 4492

I've set it to catch and mention 404s without a fuss and more loudly report and record other network errors at the script or link node levels. Let me know if it doesn't work for you!

Anonymous 11/20/2016 (Sun) 17:11:50 Id: f047d8 No. 4526

(2.50 KB 512x84 dbmd5.png)

>>4492 Thank you for this update, my e621 script works well now. However, I have a Danbooru script which is still throwing network errors when the md5 isn't found on Danbooru. I am not sure how to check which error code it is returning or if there is something else wrong so I'll include it here and perhaps you can check.

hydrus_dev Board Owner 11/22/2016 (Tue) 18:37:29 Id: f047d8 No. 4539

>>4526 Thank you for this new example. Danbooru is giving 500 (Server Error) when it fails the md5 lookup. I suspect this is an unintentional generic 'something went wrong' error on their side, as 404 seems more appropriate, but I guess the json->html forward makes it more complicated. I will create a job to extend scripts to allow interpreting certain http status errors as standard veto conditions.

Anonymous 11/22/2016 (Tue) 23:23:31 Id: f047d8 No. 4545

>>4539 That sounds good, thank you. Some other sites also have the same error, not sure if they also return 500, so the ability to catch specific http errors as veto conditions would be nice.

Anonymous 11/25/2016 (Fri) 18:59:00 Id: f047d8 No. 4594

(2.23 KB 512x84 e621 md5.png)

(2.73 KB 512x84 gelbooru md5.png)

(2.71 KB 512x84 iqdb danbooru.png)

(2.69 KB 512x84 iqdb gelbooru.png)

(2.53 KB 512x84 rule.xxx md5.png)

Ignore the ones in >>4475 and >>4484 Here are some updated scripts, they all grab the rating tag as well as the normal tags.

Anonymous 11/25/2016 (Fri) 23:39:49 Id: f047d8 No. 4595

>>4594 God bless! Any chance you can make one that works with sankaku?

Anonymous 11/26/2016 (Sat) 10:43:20 Id: f047d8 No. 4598

>>4595 Unfortunately Sankaku doesn't work right now because, unless I'm reading their API documentation wrong, it is not possible to do a straight forward search for an md5. You'd have to do something like this: http://chan.sankakucomplex.com/post/index?tags=md5:eea8d884f3127c7a4024c531e4c1f23e I don't think the current parser system is able to generate an URL like that. Perhaps Hydrus_dev can look into this?

Anonymous 11/26/2016 (Sat) 13:44:39 Id: f047d8 No. 4600

I just noticed there's a problem with the e621 script (possibly others) where the part that extracts the rating isn't always correct on some images due to the fact that the rating isn't in the usual location. The formula to extract the rating that I'm currently using is this: 1st div tag with id = stats 1st ul tag 3rd li tag 1st span tag The problem lies in the "3rd li tag" part, as the rating is not always the third thing to be displayed under the "Statistics" header, it depends on whether "source" is displayed or not, etc. The tag itself looks like this: <li>Rating: <span class='redtext '>Explicit</span></li> This could be solved if there was a way to check for the "Rating:" part within the <li> tag. I don't see a way to solve this currently as the only thing you can do is to check for keys within the <li> tag itself. Hydrus_dev, please add :)

Anonymous 11/26/2016 (Sat) 15:56:17 Id: f047d8 No. 4603

>>4598 I've been fiddling with different configurations in the script but I didn't get anywhere. I wrote the script in a way that's the same as that url but Hydrus keeps adding the equal sign after the colon. Maybe if someone can write the script for the visually similar page that sankaku offers, I don't know, maybe that will work. https://chan.sankakucomplex.com/post/similar?

Anonymous 11/27/2016 (Sun) 13:26:39 Id: f047d8 No. 4612

(2.60 KB 512x84 iqdb sankaku.png)

>>4603 Yeah, the problem is Hydrus wants to use the tag=value format for everything but here it needs to be tag:value instead. Meanwhile, here's a iqdb script for sankaku, similar to what you suggested.

Anonymous 11/27/2016 (Sun) 14:48:58 Id: f047d8 No. 4613

>>4612 Thank you for your contribution. This was way over my head.

Anonymous 11/28/2016 (Mon) 09:30:07 Id: f047d8 No. 4615

>>4600 >here's a script >no, no there isnt >but it has replies, wtf >Services > Parsing Scripts > Import >import from image what the fuck is this program

Anonymous 11/28/2016 (Mon) 20:55:10 Id: f047d8 No. 4617

>>4615 The best image manager in existence.

Anonymous 11/28/2016 (Mon) 21:45:56 Id: f047d8 No. 4618

(2.53 KB 512x84 sankaku md5.png)

>>4598 >>4595 Here's a working sankaku script.

Anonymous 11/29/2016 (Tue) 12:20:17 Id: f047d8 No. 4619

>>4618 Thank you! "/post/show" isn't listed in their API documentation for some reason…

hydrus_dev Board Owner 11/30/2016 (Wed) 01:34:07 Id: f047d8 No. 4623

>>4598 What a pain! I'll make a note of that and figure out a solution. Even if another way was found in >>4618 , I'm sure something like this will come up again. An ugly fix to this, btw, I think would be to set the 'file identifier type' as 'custom input' and then paste the md5:abcd… in manually for each request. This would obviously be a pain, but it would work well for a one-shot. >>4600 I think I just replied to you in the main release thread. Thank you for this report, I will add a string contents test to the tag search. >>4615 >>4617 :^)

Anonymous 12/06/2016 (Tue) 01:10:41 Id: f047d8 No. 4679

>>4615 Never played Artificial Academy or 3D Custom Girl? You can do amazing things with computers.

Anonymous 01/08/2017 (Sun) 15:57:28 Id: f047d8 No. 4887

is there no imgur parsing script?

Anonymous 01/08/2017 (Sun) 21:05:11 Id: f047d8 No. 4894

>>4887 For what? This thread is for parsing tags from booru, you are thinking of scripts for the downloader engine which isn't available yet, as the dev hasn't started working on it.

Anonymous 02/04/2017 (Sat) 06:40:29 Id: f047d8 No. 5091

>>4894 Is there are some way to automatic use these scripts to try tagging all imagesin DB?

Anonymous 02/04/2017 (Sat) 08:00:37 Id: f047d8 No. 5092

>>5091 Not at the moment. But you can use HTAs in this thread to import a bunch of tag mappings from various boorus: https://8ch.net/hydrus/res/2651.html

Anonymous 02/05/2017 (Sun) 20:21:48 Id: f047d8 No. 5096

>>5092 >Why there are no tag archive from sankaku?

Anonymous 02/07/2017 (Tue) 03:49:03 Id: f047d8 No. 5099

Not strictly related, but to anyone looking for porn on Yahoo's new somehow even shittier soft-censored Tumblr, Boodigo has specifically a Tumblr search (also blogspot and clips4sale) This is rather better than my former plan to exhaustively search NSFW tumblrs which was "find a new blog that hasn't been flagged yet, open a billion tabs in Vivaldi manually for reblog/source blogs that look related from the username or that pop up often, and then go through and import them". The backup backup plan was "wait for Hydrus to have custom login and download engines and write an addon that downloads all reblog lists, searches through lots and lots of link lines, and rough-maps the interconnectivity of blogs to each other and the original entry point(s)". I still might try to write a connectivity mapping script for the hydrus db for all tags under a given namespace just for general usefulness, but for now I'm glad there's actually a way to search NSFW tumblrs.

Anonymous 02/07/2017 (Tue) 04:12:47 Id: f047d8 No. 5100

I'm hoping for a future parsing script update when the parser can download files to preview a tag lookup file to compare to your file.

Anonymous 02/07/2017 (Tue) 12:50:22 Id: f047d8 No. 5102

>>5096 I think it's because sankaku has some kind of limit on how many searches you can do so it would take months to rip all the tags from the site.

Anonymous 03/16/2017 (Thu) 11:20:22 Id: f047d8 No. 5346

Bump, hoping Hydrus_dev will get back to this soon as currently there are several boorus that just won't work with Hydrus' current system. Some expect an url like this for example https://booru.com/post/index?tags=md5:36bd7e49bb64b91b731d3d6e2b3a807a Can't do it with the current system! To be honest, the best and most flexible way to do with would probably be to allow you to enter an URL and put tags in it that hydrus then will replace with the relevant information. Something like this: https://booru.com/post/index?tags=md5:<md5> Hydrus would then take that and replace <md5> with the actual md5 of the current image to generate the finished URL.

Anonymous 03/18/2017 (Sat) 06:19:44 Id: f047d8 No. 5355

>>5346 Pretty sure the custom download engine will be able to do all of that and more once it's made, he had a lot of feature ideas for it.

Anonymous 09/24/2017 (Sun) 13:07:04 Id: a19e6c No. 6832

How do I use one of these scripts?

Anonymous 09/25/2017 (Mon) 02:23:19 Id: ad789a No. 6837

>>6832 tamper or greasemonkey

List of sites using md5 (before the dev patch) Anonymous 09/25/2017 (Mon) 03:09:28 Id: 0ca1a9 No. 6838

https://github.com/CuddleBear92/Hydrus-Presets-and-Scripts/issues/12

Anonymous 10/16/2017 (Mon) 07:59:17 Id: ad789a No. 6994

Anyone know of a script to rip the :orig files from twitter media posts?

Anonymous 10/22/2017 (Sun) 00:15:50 Id: 34e5b9 No. 7046

>>6994 Well I found a script, works in tampermonkey even though its for greasemoney. It only redirects to the :orig files but at least it saves a step. https://greasyfork.org/en/scripts/9510-twitter-image-orig-promoter/code

Anonymous 11/28/2017 (Tue) 00:12:19 Id: df34a8 No. 7354

Here's the booru tag parser script for grease/tampermonkey with derpibooru included since somebody asked in another thread https://pastebin.com/XtpZAp5D

Anonymous 12/02/2017 (Sat) 19:55:31 Id: f452e5 No. 7394

>>7354 Hm, looks like my previous post didnt work,well Anyways, i updated the script to work with greasemonkey 4.0 (and broke the copy sound in the process) Its only meant as a temporary amateur fix till the author updates the original https://github.com/leonpfeil/boorutagparser/blob/master/boorutagparser.user.js

Anonymous 01/22/2018 (Mon) 04:20:14 Id: 338490 No. 7790

Anyone have any updates on the parsing scripts for danbooru, sankaku and yandere? (I'm using those in cuddlebear92 page) The one for danbooru doesn't copy the medium category (like official art, high filesize etc), and none of them copy the rating in the right way, it's either nothing, or it gets copied as "rating:rating:" On another note, i saw the md5 for sankaku isn't listed anymore in the sharing page, i though it's because it doesn't work anymore, but i tried it and i still get tags normally

hydrus_dev Board Owner 01/23/2018 (Tue) 19:08:57 Id: 3887ee No. 7807

>>7790 The whole parsing system will be updated in the coming month or so, with the existing file lookup scripts automatically converted along with it. I expect the ability to parse and test all this stuff to improve breddy soon. I imagine the existing share spaces will grow with more and better parsers as well.

Anonymous 01/24/2018 (Wed) 23:28:31 Id: f3728d No. 7822

Booru tag parsing script isn't grabbing the full rez image from Danbooru These are all variations of the same image and they parsed correctly http://danbooru.donmai.us/posts/2813183 http://danbooru.donmai.us/posts/2824474 https://gelbooru.com/index.php?page=post&s=view&id=3820627 https://gelbooru.com/index.php?page=post&s=view&id=3820897 https://gelbooru.com/index.php?page=post&s=view&id=3836020 https://yande.re/post/show/405601 This did not parse correctly; it somehow downloaded a sample size of it. It's worth noting that Hydrus itself is unable to parse and download it, but the parsing script at least gets the sample rez http://danbooru.donmai.us/posts/2812948

Anonymous 06/27/2018 (Wed) 01:09:16 Id: bf8458 No. 9278

Is there are way to scrape files and tags from Zerochan?

Anonymous 06/27/2018 (Wed) 05:06:19 Id: b461bd No. 9280

>>4475 Hitomi, Tsumino, Hentai2Read , HentaiCafe, NHentai, HBrowse and Goddess are what /a/ recommends when avoiding SadPanda

Anonymous 12/27/2018 (Thu) 12:02:50 Id: aed163 No. 11124

(5.80 KB 512x125 e621 pool lookup.png)

Here's an e621 pool lookup. Seems to work for me, images appear in correct order in the browser pane. I just need to find a better way of tagging page:* and title:* atm I drag the files onto Krename which outputs to /tmp/hydrus/<title>/<page>.<ext> and use the tag based on file name import option.

Anonymous 02/12/2019 (Tue) 06:45:32 Id: 915732 No. 11590

(4.69 KB 512x111 a.png)

Just made a realbooru one. Fucking parsers are a pain in the ass.

Anonymous 02/14/2019 (Thu) 11:17:14 Id: 915732 No. 11610

(4.98 KB 512x111 easy-import downloader png - 1 downloaders.png)

>>11590 Wait, I fucked up. Here's the fixed version.

Anonymous 02/14/2019 (Thu) 17:56:27 Id: f5ba1f No. 11616

>>4475 Can any custom parsers handle logins? Like the twitter gallery situation is still out of the picture and has been for a few months now. Fur Affinity and InkBunny if parsers are made but without logins will barely scrape any content as well. I know Hdev said FA gallery parser is coming but without login support it's hardly worth the work to make one imo.

Anonymous 02/24/2019 (Sun) 04:21:53 Id: aed163 No. 11696

>>11616 You can make your own login scripts but IMO it's not worth it, especially when the site makes heavy use of javascript or captchas. Instead, just copy the cookies from your browser session to get logged in. >network>data>review session cookies Inkbunny needs "PHPSESSID" For other sites just copy anything that has any login looking things like username, base64 or hex string values until it works.

Anonymous 02/24/2019 (Sun) 18:01:48 Id: bf8458 No. 11698

What do I need to learn about HTML or JSON so I can make downloaders?

Anonymous 03/08/2019 (Fri) 07:28:47 Id: a643ac No. 11814

I'm trying to use the iqdb-tagger python script, but there is a PermissionError when it tries to write to windows temp folder. Anyone know how to fix? I tried setting the iqdb-tagger-server.exe, iqdb-tagger.exe and python.exe to run as administrator but it doesn't help. I'm on Windows 10. https://github.com/rachmadaniHaryono/iqdb_tagger

Anonymous 03/16/2019 (Sat) 16:15:28 Id: 915732 No. 11886

>>11698 See >>11673

Anonymous 05/30/2019 (Thu) 16:07:22 Id: 7b2523 No. 12763

>>7394 I've been using the tag parser and server (https://github.com/JetBoom/boorutagparser) fine until recently: random place he decided to host the sound went down, breaking a lot of shit. Thought I'd leave a note for anyone having problems: Just right-click on the script to edit it, then comment out (//) anything to do with the sound or variable it's stored in. That should get it working again.

Anonymous 06/03/2019 (Mon) 23:09:26 Id: bf8458 No. 12848

>>12763 I only use the parser, and just deleted the link to the audio file itself. Everything still works in the parser even with it there, but you get that stupid login prompt. And here I thought the boorus got hit with some new malware or something

Anonymous 06/04/2019 (Tue) 01:46:25 Id: bf8458 No. 12849

What's the deal with it not working on derpibooru anymore?

Anonymous 06/22/2019 (Sat) 00:32:46 Id: fb1a41 No. 12997

I installed and tried using iqdb_tagger but it complains that the 'hydra-python-core' distribution was not found and is required by hydrus. What gives?

Anonymous 06/27/2019 (Thu) 17:53:59 Id: 419f98 No. 13042

Has Pixiv parsing stopped working for anyone else recently?

Anonymous 06/27/2019 (Thu) 19:50:44 Id: bf8458 No. 13044

>>13042 What do you mean? There was no parser for pixiv. If you mean those extensions that let you direct load the images then those have broken for 1 year+ since pixiv keeps editing its sites

Anonymous 06/27/2019 (Thu) 20:43:06 Id: 419f98 No. 13045

>>13044 I might have found a custom set from CuddleBear92's GitHub repo (I sure as fuck didnt write them) but I had been reliably importing pixiv urls just days ago and now they error out; can't find anything. I havent looked into it too hard yet but was wondering if I'm alone

Anonymous 06/28/2019 (Fri) 01:03:37 Id: bf8458 No. 13048

>>13045 I think it's just you; (I'm using the Hydrus default pixiv parser) I made my 32 artist subs check now and they went through with no errors. But they had already checked recently so there weren't any files to snag. No idea why it would work yesterday but not today, unless that was made before they revised their site and they just happened to leave the old code running as a fallback till now.

Anonymous 06/28/2019 (Fri) 05:11:51 Id: 303955 No. 13052

The built in script for using iqdb to look up tags from danbooru works for me. There are many more like it on cuddlebear92's website, but they are 2 years old and don't seem to work at all anymore. I just want something that works the same as the built in function for other sites like sancom, gelbooru, etc., but it seems I'm left high and dry. I don't understand why it doesn't work anymore either. I went through the logic of the iqdb gelbooru script, for instance, and compared it with the HTML actually sent back by the website, the logic still seems sound.

Anonymous 06/28/2019 (Fri) 05:53:02 Id: 59b96b No. 13053

>>13042 Not just you, same happened to me on both the default hydrus parsers and the custom pixiv all-in-one set. Everything gets ignored and has been for a couple of days. All the pages come in as if I'm not logged in for whatever reason

Anonymous 06/28/2019 (Fri) 06:19:15 Id: 303955 No. 13054

>>13052 Hmm, I've found that the gelbooru one actually works off and on. Sometimes it oddly just returns a list with 4 crosses, instead of a list of actual tags though. Now then, what I'd really like to do is automate running file look up scripts on more than one file and automatically apply all tags to each file. There doesn't seem to be away to do this through the interface when more than one file is selected, but there has to be a way, right?

Anonymous 06/29/2019 (Sat) 11:14:36 Id: 7fead0 No. 13056

Hit the same pixiv issue just now. The login itself doesn't seem to be the issue, I reset and redid the login within Hydrus but that seems to have changed nothing.

Anonymous 06/29/2019 (Sat) 13:04:35 Id: 12d78e No. 13057

(4.18 KB 512x93 pixiv file page api parser.png)

Pixiv changed their API so the parser had to be redone. You can replace the old one with this one or wait until Wednesday as it should be in the next release. Also pixiv added captcha to login so you have to import cookies manually now. The login in hydrus won't work.

Anonymous 07/07/2019 (Sun) 15:43:02 Id: 566fd9 No. 13138

(3.58 KB 512x111 newsankakuparser.png)

The sankaku parser someone posted on this board that was supposed to remove the 2000 files limit didn't work properly for me, due to the naive way the parser fetched the next gallery page data I think, so I made a fix some while ago that works on my machine (TM). Please let me know if it works on yours, too.

Anonymous 07/11/2019 (Thu) 02:44:40 Id: 303955 No. 13162

>>13138 Working a treat right now. I understand a bit of html, but these parsers make no sense to me. Maybe I'll sit down and spend time to figure out how to do this myself sometime.

Anonymous 01/09/2020 (Thu) 06:35:50 Id: 69fa49 No. 13525

(3.63 KB 512x113 8kun downloader.png)

I'm not sure if this has been fixed yet, but I modified the default 8ch parsers to allow hydrus to download 8kun threads with filenames.

Anonymous 02/07/2020 (Fri) 20:43:22 Id: 085c83 No. 13616

The JSON API for boards like gelbooru returns all the tags, as well as the path to the files, hash, source, updated time, etc. Example https://gelbooru.com/index.php?page=dapi&json=1&s=post&q=index&limit=50&tags=cat%20rating:safe&pid=2 (The tags are HTML-escaped, but I don't know about other entries) So why do the gallery downloaders scrape HTML for each page instead of using all the information obtained from a search request? If I do a search for a set of tags, the downloader has to download the HTML for every single post's page just to check for duplicates and tags. It's a lot of wasted resources/effort for both client and server. If I already have all 50 files that turn up in the linked search, in total I did 1 request instead of 51 to verify that. Similarly, if I had to download all the images, in total it was 51 requests instead of 101, with the bonus that no HTML scraping had to be performed.

Anonymous 02/07/2020 (Fri) 23:56:30 Id: 085c83 No. 13620

I noticed gelbooru's JSON API returns tags as a single string with each tag delimited by spaces. Is there a way to split a JSON string match into multiple entries?

Anonymous 03/09/2020 (Mon) 11:59:01 Id: 2f0bc3 No. 13762

So I'm a newcomer to making downloaders, I made a bunch of url classes, such as for an HTML page of an album that contains many images, it redirects to an API call, which also has it's own class, I made parsers for the API response, selected which API query element corresponds to the next page (such as offset) and even added a next page URL in the parser. But no matter what I do, when I drag & drop an album's URL into Hydrus, it only downloads the first page worth of images and never goes further. Is it supposed to work like that? Do I have make something like GUG to make the continuous downloading work?

Anonymous 04/14/2020 (Tue) 23:25:40 Id: 1be05b No. 13953

(2.71 KB 512x112 e621_updated_parser.png)

Friendly neighborhood anon here - e621 seems to have added 'lore' and 'meta' tagtypes which the default parser can't catch - this updated parser can catch them.

Anonymous 04/23/2020 (Thu) 23:28:57 Id: dfaa46 No. 14001

I previously used a modified version of saucenao's generic script to automatically(-ish) reverse image search untagged images that show up, but now that e621 has their own reverse search, I whipped up my own python script. e621's reverse search also doesn't have a cap on searches done in 30s/24hr (it does require an account tho). https://gist.github.com/corposim/b7ccb6a2c8814032ddd65db91b371dc2

Anonymous 04/26/2020 (Sun) 02:23:43 Id: 655b38 No. 14013

(3.17 KB 512x113 babeswp_docl.png)

wasted my time on this

Anonymous 05/18/2020 (Mon) 17:08:07 Id: a51669 No. 14320

This might have been asked before, but is there a downloader for NicoSeiga? If not, does anybody know other tools for that?

Anonymous 05/21/2020 (Thu) 12:52:46 Id: 0efa77 No. 14328

I'm trying to get Hydrus to download from smugloli.net. I have made url classes that match the URL and created an API URL for the json, but when I try to watch a thread it instantly says "DEAD" with the log message saying there was no parser. It should work if the "4chan-style API parser" is used, but I have no clue how to make it use that.

Anonymous 05/30/2020 (Sat) 03:53:24 Id: 939c32 No. 14372

Anyone know what the situation is with gfycat redirecting NSFW content to some sort of sister site? I guess they intend for you to browse their new site "redgifs" but following old nsfw gfycat links takes me to "gifdeliverynetwork" Anyway in short I got some sort of gfycat/redgifs downloader bundle from cuddlebear's hydrus scripts git repo but I'm not really sure what to do with them and I can't download videos straight from redgifs like I used to with gfy, anyone else in a similar spot?

Anonymous 06/06/2020 (Sat) 00:45:27 Id: 7c774d No. 14403

(2.90 KB 512x112 pillowfort_post_parser_2020_06_05.png)

With the number of artists attempting to migrate to pillowfort from twitter, I tried my hand at building something to parse pillowfort posts. It could probably still use some cleanup and correction, but figured it was worth putting out there since I've gotten it to work for me pretty well so far.

realbooru Anonymous 06/13/2020 (Sat) 17:23:48 Id: 655b38 No. 14437

(3.63 KB 512x112 realbooru.png)

here's an updated realbooru downloader; includes a gug, post and gallery urls and a parser tags work well.

Anonymous 06/18/2020 (Thu) 08:54:12 Id: 0891ac No. 14466

Can the nijie parser download video and manga? It doesn't look like it from what I saw, but I may have missed a step. While I'm asking, How would I automatically fetch the nijie work:# ?

Anonymous 06/19/2020 (Fri) 22:35:07 Id: 7d3571 No. 14471

(3.85 KB 512x108 agnph_all_in_one.png)

Friendly neighborhood anon here. Someone once asked for an agn.ph downloader. This is an all-in-one that should work for the site.

Anonymous 06/20/2020 (Sat) 04:35:32 Id: 5a9c6e No. 14474

is there a parser for the FA Onion Archive?

Anonymous 07/17/2020 (Fri) 02:49:44 Id: b4a834 No. 14558

anything for rule34hentai?

Anonymous 08/06/2020 (Thu) 05:45:01 Id: 963f0c No. 14624

(14.14 KB 512x133 docl_instagram.png)

^wrong one, here is the one that works, tagging kindaaa works but location tags are busted (instagram)

Anonymous 09/08/2020 (Tue) 23:19:30 Id: 7907b3 No. 14710

>>14437 I think this has changed again, I'll give it a look but I am not good at it at all.

Anonymous 09/16/2020 (Wed) 06:01:36 Id: 3fc0bd No. 14739

>>14710 Anything on this anon?

Anonymous 09/18/2020 (Fri) 11:11:34 Id: 6c7e0e No. 14748

realbooru parser that functions at least

Anonymous 09/30/2020 (Wed) 18:12:29 Id: 8c2d2c No. 14782

Sankaku is now hiding lolis. Is there some way to get around this?

Anonymous 10/21/2020 (Wed) 18:09:27 Id: 4920bd No. 14837

I'm not sure if GUGs can make these, but anyone have a module for setting up Youtube subscriptions?

Anonymous 10/24/2020 (Sat) 06:29:33 Id: a382d2 No. 14847

>>14782 They're not hiding lolis. I don't understand why I keep hearing this. Did you check the mature content option in settings and clear your account blacklist? Do you have an account in the first place?

Anonymous 01/17/2021 (Sun) 03:06:16 Id: f5d248 No. 15140

Can someone help me understand what parsing scripts are for, and how to use them? Are they to improve the amount of tags that are found for images? Like a reverse search?

Index Catalog Archive Top Reply

Manage Board Moderate Board Moderate Thread

Forms

Delete

Password Unlink (Removes file reference from posts) Delete (Removes file from the server)

Report

Reason Category Global

No Cookies?

Quick Reply


Sage Bypass Check