by Mike Thomas

4 January 2025

Generative AI will create images for you based on text prompts. For instance, using ChatGPT and DALL-E, and the prompt “Create 16 by 9 photo realistic image of woman flying steampunk spaceship” resulted in the following image:

I have been using AI to create cute little graphics like for our Christmas Party. I used the prompt “generate a facebook header for the Arundel Camera Club Christmas Party featuring Santa holding a camera” to generate the following image:

It’s clear that AI Technology is here to stay and advancing quickly. I can certainly see why AI is threatening the livelihoods of artists and photographers. Advances in creating more realistic images and even videos may pose a threat to writers, models, and actors. In fact, the recent strike in Hollywood was partly in response to AI. Besides threatening livelihoods, AI is training on the work of artists and writers. Many see this as steeling.

Club Rules with Respect to AI

Ethics aside, our club rules do not allow club members to enter images into competitions that were created by AI.

Specifically, our contest rules state:

“It is unethical, constitutes plagiarism, and violates copyright law to simply copy an image (even in another medium) that was made by someone else. This includes the use of images generated by artificial intelligence (AI) that was trained using other artists’ work. Images determined to be copies of another’s artist’s work, or to have used another’s work, will be disqualified.”

AI Critique

Are there other ways to use AI to help with your photography? What if you could use AI to critique your contest entries before submitting them into contest. Could that help you improve your images? Maybe.

To test this, I submitted four images to ChatGPT and asked it to critique the four images.

ChatGPT Prompt: Describe and critique the following four photographs.

Could you provide descriptions of the photographs, or upload the images if you’re able? With descriptions, I can provide critiques and suggest ways to enhance their aesthetic or technical aspects.

 

I uploaded the four images to ChatGPT and here is the response.


 

Photo 1 (Image1 – Bird in Flight)

  • Strengths:

    • The vibrant colors of the bird (likely a Painted Bunting) are stunning and immediately draw the viewer’s attention.
    • Capturing a bird in mid-flight is technically challenging, and the image showcases great timing and skill.
    • The blurred wings add a sense of motion, making the photo dynamic.
  • Areas for Improvement:

    • The background, while natural, feels a bit busy and detracts from the subject. A wider aperture (smaller f-stop) could create more background blur (bokeh).
    • The focus appears slightly off, with more attention on the bird’s body rather than its head and eyes, which are critical focal points in wildlife photography.
    • Cropping or reframing might enhance the composition by placing the bird slightly off-center for a more balanced look.

 

Photo 2 (Image2 – Perched Bird)

  • Strengths:

    • Excellent sharpness and focus on the bird, especially around its eye, which creates a strong connection with the viewer.
    • The pose is natural, and the bird’s vivid colors contrast beautifully with the earthy tones of the background.
    • Good use of depth of field to isolate the subject.
  • Areas for Improvement:

    • The stick in the foreground, though part of the natural setting, might feel slightly cluttered. Adjusting the angle to minimize distractions could improve the composition.
    • Experimenting with lighting, such as softer early-morning or late-afternoon light, might reduce the slightly harsh tones and add more depth to the image.

 

Photo 3 (Image3 – Bird Clinging to a Branch)

  • Strengths:

    • A unique pose with the bird clinging to the branch, showcasing a different angle and behavior.
    • The color palette is rich, with the greens in the background complementing the bird’s red and blue hues.
  • Areas for Improvement:

    • Similar to the first photo, the background could be blurred further to make the bird stand out more.
    • The framing could be adjusted to include more space around the bird, creating a more balanced and less cramped composition.

 

Photo 4 (Image4 – Another Bird in Flight)

  • Strengths:

    • The bird is captured mid-flight with wings spread wide, demonstrating action and vitality.
    • The composition is balanced, with the bird’s head well-positioned in the frame.
    • The rich colors are again a highlight, maintaining visual interest.
  • Areas for Improvement:

    • The motion blur in the wings is appealing, but the bird’s face could use sharper focus to make the image more impactful.
    • The background, while natural, has some distracting elements. Post-processing to subtly reduce the contrast of background elements might help.

General Recommendations:

  1. Focus on the Eyes: Always prioritize the bird’s eye as the focal point; it adds life to the image.
  2. Background Blur: Use a wide aperture and/or increase the distance between the subject and background to create a smoother, less distracting background.
  3. Lighting: Shoot during the golden hours (early morning or late afternoon) for softer, more dramatic lighting.
  4. Post-Processing: Adjust contrast, saturation, and clarity to enhance the bird’s features while minimizing distractions in the background.

These are fantastic captures showcasing great skill and patience. Small tweaks could elevate them further!

 

ChatGPT Prompt: Compare and contrast the four images

Here’s a comparison and contrast of the four images, focusing on key aspects like composition, lighting, action, and background:


1. Subject Behavior

  • Image 1 & Image 4: Both showcase the bird in flight, capturing a dynamic moment. The motion of the wings in these images adds energy and movement.
  • Image 2 & Image 3: Both depict the bird at rest, with Image 2 showing the bird perched on a branch and Image 3 highlighting it clinging to a vertical twig. These moments feel more intimate and allow for finer detail in the bird’s posture.

2. Focus and Sharpness

  • Image 1 & Image 4: The sharpness is slightly compromised due to motion. The wings are blurred (which is expected and adds a sense of motion), but the face and eyes could be crisper for greater impact.
  • Image 2 & Image 3: Both are sharp, with excellent focus on the bird’s body and especially its eye. Image 2 is slightly more striking in clarity than Image 3.

3. Composition

  • Image 1 & Image 4: The compositions are action-oriented but feel slightly busy due to the dense background of branches. The framing, while balanced, could benefit from more negative space around the bird to emphasize its flight.
  • Image 2: The composition feels natural and balanced, with the bird centrally positioned and the perch adding context.
  • Image 3: The vertical orientation of the twig introduces a unique line for the viewer’s eye to follow, making this image more dynamic than Image 2.

4. Background and Environment

  • Image 1 & Image 4: The backgrounds in these images are busy with branches and foliage, which slightly distract from the bird in flight. A more blurred or simplified background would help isolate the subject.
  • Image 2 & Image 3: These have calmer, less intrusive backgrounds. Image 2, in particular, uses depth of field effectively to blur the background, enhancing the bird’s prominence.

5. Action vs. Stillness

  • Image 1 & Image 4: Represent action and energy, showcasing the bird mid-flight. They evoke excitement and capture the fleeting nature of bird photography.
  • Image 2 & Image 3: Offer a sense of stillness and intimacy, providing the viewer with a moment to appreciate the bird’s detailed plumage and vivid colors.

6. Color and Lighting

  • All four images beautifully capture the bird’s vibrant plumage, especially its deep blues, reds, and greens.
  • Image 2 stands out for its even lighting and rich tones, which bring out the bird’s colors most effectively.
  • Image 1 & Image 4 have slightly harsher lighting, creating less contrast between the bird and the background.

Overall Assessment

  • Best for Action: Image 4. Among the two flight shots, this one has better composition and framing of the bird.
  • Best for Detail and Intimacy: Image 2. Its sharpness, focus, and balance make it the most striking and polished image.
  • Most Unique Pose: Image 3. The bird’s clinging pose makes it stand out, though slightly more space around the subject would enhance the image.

Each photograph brings something unique to the table, demonstrating different facets of bird photography, from action and behavior to detail and composition.

 

ChatGPT Prompt: which image is best

The best image is Image 2 (the perched bird) due to the following reasons:

  1. Focus and Sharpness: The bird is tack-sharp, particularly around the eye, which is the most important focal point in wildlife photography.

  2. Composition: The bird is positioned well within the frame, with an uncluttered perch that complements the subject. The image feels balanced and natural.

  3. Background: The blurred background effectively isolates the bird, drawing attention to its vivid colors without distraction.

  4. Lighting: The lighting in this image is even, enhancing the bird’s vibrant plumage and creating a pleasing tonal range.

  5. Intimacy: The stillness of the bird allows the viewer to appreciate its details, such as the texture of the feathers and the richness of its colors.

While the action shots (Image 1 and Image 4) are exciting and technically impressive, and Image 3 has a unique pose, Image 2 stands out for its overall polish, aesthetic appeal, and technical execution.

 

ChatGPT Prompt: rank the four images

1st Place: Image 2 (The Perched Bird)

  • Why:
    • Exceptional focus, especially on the eye.
    • Well-balanced composition and use of depth of field.
    • Vibrant colors and effective isolation of the subject from the background.
    • Evokes a calm, intimate moment that allows the viewer to connect with the bird.

2nd Place: Image 3 (The Bird Clinging to the Branch)

  • Why:
    • Unique and dynamic pose, showing an interesting behavioral aspect of the bird.
    • Good focus and detail, though slightly less sharp than Image 2.
    • The background is slightly busier than in Image 2, but the bird’s vibrant colors still pop.

3rd Place: Image 4 (Bird in Flight – Close Frame)

  • Why:
    • Captures the bird mid-flight with a strong composition.
    • The motion blur in the wings adds dynamism, but the head and eye could be sharper.
    • The background is slightly distracting, which diminishes the overall impact.

4th Place: Image 1 (Bird in Flight – Straight-on View)

  • Why:
    • Captures a bold frontal view of the bird in flight, which is technically challenging and exciting.
    • However, the focus is less precise compared to the other images, and the busy background detracts from the subject.
    • While action-packed, it lacks the polish and clarity of the higher-ranked images.

Summary

  1. Image 2: Best overall for clarity, composition, and aesthetic.
  2. Image 3: Unique and intimate, but slightly less refined.
  3. Image 4: Dynamic and energetic, but could benefit from sharper focus.
  4. Image 1: Bold and impressive, but slightly less polished than the others.

Each image demonstrates excellent skill and highlights different aspects of bird photography, so all are commendable!

 

You can refine your results by providing a more specific prompt.

ChatGPT Prompt: Analyze each photo including focus and clarity, color, composition, lighting, background, subject presentation

I won’t give the results for each photo due to the size of the output. But here is ChatGPT’s response for Image 1.

Image 1 (Bird in Flight – Straight-on View)

  • Focus and Clarity:

    • The motion blur in the wings effectively conveys movement, but the focus on the bird’s head and eyes is slightly soft, which reduces its impact.
    • The legs and body are sharper, but the eyes, a critical focal point in wildlife photography, are not as defined.
  • Color:

    • The bird’s vibrant plumage (blue, red, and green) stands out beautifully, but the saturation could be slightly enhanced in post-processing to make it more vivid.
    • The colors of the background clash a bit with the subject, reducing the overall harmony.
  • Composition:

    • The frontal perspective creates a dramatic and engaging viewpoint.
    • However, the bird is centered and closely cropped, which feels a bit tight. Placing the bird off-center might create a more dynamic composition.
  • Lighting:

    • The lighting is natural but slightly harsh, leading to some uneven exposure on the bird’s feathers.
    • Softer lighting, such as during golden hour, would enhance detail and reduce the harshness.
  • Background:

    • The background is cluttered with branches and foliage, making it harder to isolate the subject.
    • A shallower depth of field could have blurred the background more effectively.
  • Subject Presentation:

    • The bird’s flight is captured at an exciting moment, but the lack of sharpness on the head and eyes limits its emotional connection with the viewer.

 

ChatGPT even provides a summary table and final thoughts.

 

Summary Table

Criteria Image 1 Image 2 Image 3 Image 4
Focus and Clarity Moderate Excellent Good Moderate
Color Vibrant, slightly busy Rich and natural Vibrant and balanced Vivid, slightly harsh
Composition Tight, centered Balanced, spacious Unique, slightly cramped Dynamic but tight
Lighting Harsh in places Soft and even Natural, balanced Uneven
Background Cluttered Clean and blurred Slightly busy Cluttered
Subject Presentation Dynamic but soft Intimate and sharp Dynamic and unique Powerful but soft

 

Final Thoughts

  • Best Overall: Image 2. It excels in all categories, with sharp focus, rich colors, clean composition, and great lighting.
  • Most Unique: Image 3. The clinging pose is dynamic and showcases interesting behavior.
  • Best for Action: Image 4. The motion is captivating, but focus and background could improve.
  • Most Dramatic: Image 1. The frontal flight pose is engaging but lacks the sharpness and clarity of the other images.

 

Conclusion

I was surprised by the results. I did like the critiques. It gave me something to consider for all the images. So yes, using AI to critique your images may be useful. Remember AI is just a tool, and you are the artist. If you love an image, enter it. Make it your own.

You may not agree with the AI final ranking but then again, we don’t always agree with our human judges. While many of our volunteer judges are truly insightful and provide valuable critiques during contests, AI could be better than some of our less skilled judges. In fact, I think using AI could help train better judges. However, I don’t think AI can replace the human touch, the understanding and inspiration of art. We had a judge who liked trains, and no surprise gave an award to a train photo. We had a recent judge who assigned a great deal of meaning to the American flag because it resonated with him. So, art isn’t always about ISO, Shutter Speed, and Aperture.

Tools

I used the paid version of ChatGTP with the 4o model. There is a free version. There are also other models to choose from. ChatGPT has several commercial competing Large Language Models including Google Bard, Microsoft Bing AI, Anthropic Claude, Perplexity AI, etc. Using different LLMs and models will result in different responses. I wouldn’t even expect to always get consistent responses from the same tools and models.

ChatGPT used DALL-E 3 to create the images. There are several competing AI image generators including Stable Diffusion, Midjourney, Adobe Firefly, etc.