simplified version(the dataset has not changed)
dataset of 16 images of game screenshots
use trigger words