← Workbook
Dossier Print
sequential

Image to Motion: Animating Stills

Image to Motion

OpenAI's Sora demonstration in February 2024 triggered a wave of coverage that struggled to distinguish between "technically impressive" and "commercially ready." The gap between a 20-second demo clip and a usable production tool is large. The gap between the demo and the regulatory framework for the content it might produce is larger.

Medium 2024-03-15 scientific

OpenAI Sora Demo Reveals Persistent Limitations With Hands, Text, and Physics

Analysis of OpenAI's Sora demonstration clips from February 2024 reveals consistent failure modes. The model lacks implicit understanding of physics, with objects vanishing, deforming, or replicating unexpectedly. Hand articulation remains problematicβ€”characters exhibit anatomical anomalies such as additional appendages or merged limbs. Text rendering is unreliable. These failures match patterns documented in earlier video generation research, suggesting Sora does not fundamentally solve the technical challenges that have constrained the field for two years.

"The demo showed the best outputs. The physics still fail."

Source β†—
Times of Israel 2025-01-14 legal

Experts Debate Whether AI-Generated Holocaust Content Aids Memory or Distorts It

Educators, ethicists and Jewish leaders are divided on AI's role in Holocaust remembrance. While projects like Meta's 'Tell Me, Inge...' use AI holograms of survivors to preserve testimony for future generations, critics warn that AI-animated figures risk becoming ethically hollow, lacking the embodied presence central to authentic memory transmission. Deepfake animation of Holocaust figures evokes the uncanny valley effect; as technology improves, emotional responses could override rational perception, potentially amplifying historical falsehoods or enabling denial.

"The tool that preserves testimony can also fabricate it."

Source β†—
Variety 2024-12-20 economic

Animation Guild Ratifies Contract With Weak AI Protections Despite Director Opposition

The Animation Guild ratified a 2024-27 contract with studios on December 22, 2024, with 76.1% approval, despite concerns from prominent members including "The Mitchells vs. the Machines" director Mike Rianda. Rianda stated the AI terms were "far from what we were going for" and warned that "studios can replace workers with AI. Studios can force you to use AI." The contract grants workers the ability to "consult" with productions on AI tool use but includes no staffing minimums to prevent AI-driven job losses. Rianda's final warning: "A 'yes' vote means no AI protections for three years."

"Consultation rights mean studios heard you. Then did it anyway."

Source β†—
Korea Herald 2024-09-11 legal

South Korean Law Criminalizes Deepfake Creation as K-Pop Industry Suffers Record Surge

South Korea's deepfake epidemic escalated dramatically in 2023: deepfake pornography cases surged 4.6-fold to 21,019, with South Korean singers and actresses comprising 53 percent of victims. In response, the National Assembly passed legislation criminalizing creation and distribution, with penalties reaching five years imprisonment and 50 million won ($37,400) in fines for creators. Distributors face up to seven years. However, enforcement gaps remain: a 2020-2023 analysis showed only four of 71 deepfake crime convictions resulted in imprisonment, and no laws yet penalize possession or viewing.

"The law exists. Enforcement is another matter."

Source β†—

The big question

Sora can animate historical photographs into realistic video. Should there be legal restrictions on animating archival images of real people, and if so, where is the line?

passage

Brief

Converting Stills to Motion

Image-to-video AI works by learning patterns from thousands of video clips: how light changes, how objects move, how one frame transitions to the next. When you give it a still image, it extrapolates forward β€” inferring plausible motion based on what it has seen.

The mechanism. A neural network trained on video learns two things: spatial features (what is in the frame) and temporal features (how things change over time). When you feed it a single image, the model fills in the "next frames" by predicting likely motion vectors β€” the direction and speed objects should move, how shadows shift, how depth might unfold.

Why it matters for you. A still photograph is a moment. Animation is a sequence of moments. Image-to-video AI collapses that boundary β€” you can now take a photograph and extend it into motion without shooting video, without frame-by-frame drawing. A portrait can become a breathing face. A landscape can show clouds drifting. A product shot can show the object rotating.

Real limits. The AI is not omniscient. It guesses motion based on probability. A photograph of a person standing still might animate their clothes fluttering gently β€” plausible, but not necessarily what you intended. Objects moving out of frame disappear. The duration of output is fixed (typically 6–10 seconds). Motion feels smooth but not always physically accurate.

Your job. Pick a still image with enough visual information for the model to infer motion. Describe the motion you want β€” subtle or dramatic. Run the generation. Judge the result: does it feel right? Is the motion coherent? Does it match your intention? Refine and iterate.

title

WEEK 2: BUILDING YOUR FILM

Make a Photograph Breathe

A still is dead. The moment you make it move, it lives.

Learn the principles of believable motion β€” the Disney animation rules adapted for AI video generation. You will write prompts that tell an AI exactly how to animate your hero shots, turning static frames into flowing, living action. No keyframe guessing. No render loops. Just clarity.

1 / 7

Loading activity...

Answer key
**Correct order:** lock a reference β†’ write the motion prompt β†’ set first and last frame β†’ interpolate β†’ cut to length.

**The break: the reference lock (stage 1).** A mid-clip face-morph means the model had no stable identity anchor to hold across frames β€” interpolation faithfully blended two *different* faces. It is not an interpolation bug; interpolation did its job on bad inputs. Fix upstream: lock subject + seed before any motion. **A late-stage symptom usually has an early-stage cause.**

Loading activity...

Answer key
**Suggested answers:**

Still 1 (Person at desk):
- Plausible: Head turning slightly, hand moving the pen, gentle breathing/chest movement, eyes blinking
- Risk: The AI might animate the notebook or pen in ways that don't match the intended "writing" motion. It might assume the person is looking down more than they are. The motion might feel jerky if the original image is too sharp/posed.

Still 2 (Mountain landscape):
- Plausible: Cloud drifting across the sky, subtle lighting shift as time passes
- Risk: The AI might animate parts of the mountain (making rock faces shift slightly), or add motion to static foreground elements. Real clouds would take hours to drift significantly; 6–10 seconds will show minimal cloud movement.

Still 3 (Coffee mug):
- Plausible: Steam rising, subtle wisps, objects on the table settling or shifting slightly
- Risk: The AI might animate the mug itself (making it rock or tip) or the notebook pages (making them flutter without cause). The steam motion will be repetitive and loop, not realistic continuous rising.

**Grading:** Award full marks for identifying at least one plausible motion and one realistic pitfall per image. Partial credit if the motion identified is too generic ("things move") or the pitfall is vague.
Task

Convert

Choose a still image β€” photograph, illustration, or screenshot β€” that you think will animate well. Describe the image in detail: what's in the frame, where objects are positioned, what the lighting looks like. Then write a motion prompt: what motion do you want the AI to infer? Be specific β€” are you animating a person's expression, an object's rotation, environmental change (light, weather, particles), or combination? Run the image through an image-to-video tool (Runway Gen-3, Pika, or similar). Generate a 6–10 second motion clip. Review the output. In a brief reflection, answer: Did the motion match your intention? What did the AI understand correctly? What surprised you or felt wrong? Would you iterate β€” changing the prompt, the image, or both β€” and why?

Open Claude Output Β· project
image-to-motion Β· content dossier Β· teacher copy