In all seriousness, the next best dataset if this takes off might be e621. I agree with /u/gwern - I think this really could be a very useful dataset for more advanced image understanding techniques, and I frankly don't see how we could otherwise get a better dataset with similarly rich metadata for any feasible amount of effort. I really hope this doesn't end up getting ignored as "not serious" or "not respectable" as a dataset.
![anime images database anime images database](https://d2vlcm61l7u1fs.cloudfront.net/media%2F2f2%2F2f27a66a-9b40-427a-8edf-07e4fbb9936a%2FphpYZkKmr.png)
And Danbooru is constantly expanding and can be easily updated by anyone anywhere, allowing for regular releases of improved annotations. It is unlikely that the performance ceiling will be reached anytime soon, and advanced techniques such as attention will likely be required to get anywhere near the ceiling. While the Danbooru community does focus heavily on female anime characters, they are placed in a wide variety of circumstances with numerous surrounding tagged objects or actions, and the sheer size implies that many more miscellaneous images will be included. In contrast, the Danbooru dataset is larger than ImageNet as a whole and larger than the current largest multi-description dataset, MS COCO, with far richer metadata than the "subject verb object" sentence summary that is dominant in MS COCO or the birds dataset (sentences which could be adequately summarized in perhaps 5 tags). Metacademy is a great resource which compiles lesson plans on popular machine learning topics.įor Beginner questions please try /r/LearnMachineLearning, /r/MLQuestions or įor career related questions, visit /r/cscareerquestions/
![anime images database anime images database](https://pbs.twimg.com/media/Cc4X6XZUsAEwlQl.png)
Please have a look at our FAQ and Link-Collection Rules For Posts + Research + Discussion + Project + News on Twitter Chat with us on Slack Beginners: