So you can do text to image with thr text to video model (only) but it does make images. If you want to make images set frames to 1 and bypass the video combine and enable the preview.
Just remember if you want to do text to image you must use a text2video Wan model NOT the image2video