Why fine-tune an image model?

If you’re using Midjourney, FLUX, or Dalle to generate images, you may struggle to create consistent photos of your actual products. That’s because these models are general-purpose*,* and don’t know anything about your brand.

EverArt makes it extremely easy to train AI on product or lifestyle photography. All you have to do is upload a few images of your product or visual style and wait 10-20 minutes for the training to finish. From there, you can submit simple prompts to generate stunning visual content for immediate use in marketing, advertising and editorial*.*

Even simple prompts will do.


Creating your first model

  1. Tap “New Model.”
  2. Select the type of model to train (Aesthetic, Product, or Face).
  3. Upload your images into EverArt. Be extremely selective about what you upload (tips below)

Untitled


Model types:

Product Models:

Choose this option when you want to train on a consistent physical or digital product. The thing that’s consistent in your image uploads (for training) should be the product, everything else can variate.

Frame 10.png

Frame 14.png

(example dataset of a model trained on a can of pepsi)

CleanShot 2024-09-24 at 21.05.25@2x.png

Aesthetic Models:

Choose this option if there’s a photography style or mood board that you want to capture the aesthetic of. The thing that should be consistent in the data you upload for training is the visual “vibe” of your uploads (e.g. same lighting, coloring, or visual style). If you upload images that have too different of an aesthetic, it won’t work very well. The objects/subject can vary between the images.

Face Model:

Train a “face” model if you want to generate images of a specific person. Just upload 6-15 images of them, making sure the person you want to train on is the only person in each image. Try incorporating a mix of closer photos to reveal facial details and medium-distance photos to capture physique, don’t upload images where the person is too far away.