I don’t think anyone knows for sure what files those are. It would’ve been helpful if NYT published the file names. But maybe NYT isn’t sure themselves as they wrote some of the images are “possibly” of teenagers.
To be on the safe side, I guess you could just remove all nude images from the dataset. It is a ton of images to go through though, hundreds of thousands.
I don’t think anyone knows for sure what files those are. It would’ve been helpful if NYT published the file names. But maybe NYT isn’t sure themselves as they wrote some of the images are “possibly” of teenagers.
To be on the safe side, I guess you could just remove all nude images from the dataset. It is a ton of images to go through though, hundreds of thousands.