Derpy Whooves
Looking For My Doctor
I’m not on the dev team, but downloading a representative image set that’s already been tagged to train your algorithm, and to compare to human taggers, shouldn’t be any kind of problem. People are downloading gigabytes of images themselves by hand every day and people do regular grabs of new images on the site, so I don’t imagine that you wouldn’t be able to grab enough images for a good base set without any problems.
I would suggest, however, that you choose images that have high scores to train your algorithm; they have a better chance of having been tagged correctly, or having been reported if they were tagged incorrectly. If images haven’t been voted on by at least a couple hundred people there isn’t any guarantee that the image was ever tagged correctly to start with.
I would suggest, however, that you choose images that have high scores to train your algorithm; they have a better chance of having been tagged correctly, or having been reported if they were tagged incorrectly. If images haven’t been voted on by at least a couple hundred people there isn’t any guarantee that the image was ever tagged correctly to start with.