GitHub - cnemri/awesome-gemini-omni: A curated list of awesome Google Gemini Omni prompt guides, interactive platforms, and creative showcases. · GitHub
/" data-turbo-transient="true" />
Skip to content
Search or jump to...
Search code, repositories, users, issues, pull requests...
-->
Search
Clear
Search syntax tips
Provide feedback
--><br>We read every piece of feedback, and take your input very seriously.
Include my email address so I can be contacted
Cancel
Submit feedback
Saved searches
Use saved searches to filter your results more quickly
-->
Name
Query
To see all available qualifiers, see our documentation.
Cancel
Create saved search
Sign in
/;ref_cta:Sign up;ref_loc:header logged out"}"<br>Sign up
Appearance settings
Resetting focus
You signed in with another tab or window. Reload to refresh your session.<br>You signed out in another tab or window. Reload to refresh your session.<br>You switched accounts on another tab or window. Reload to refresh your session.
Dismiss alert
{{ message }}
cnemri
awesome-gemini-omni
Public
Notifications<br>You must be signed in to change notification settings
Fork
Star
main
BranchesTags
Go to file
CodeOpen more actions menu
Folders and files<br>NameNameLast commit message<br>Last commit date<br>Latest commit
History<br>2 Commits<br>2 Commits
media
media
CONTRIBUTING.md
CONTRIBUTING.md
LICENSE
LICENSE
README.md
README.md
View all files
Repository files navigation
Awesome Gemini Omni
Gemini Omni is Google's next-generation, natively multimodal AI model capable of seamlessly processing and generating text, code, images, audio, and video. The Gemini Omni Flash model is also officially available to try directly in the Gemini App.
Contents
Official Resources
Interactive Platforms
Capabilities and Showcases
Tutorials and Courses
Official Resources
Official Product Page - Official overview of the Gemini Omni model architecture, native multimodality, and core features.
Prompt Guide - Official comprehensive guidelines by Google DeepMind for designing effective multimodal prompts.
Model Card - Official model card outlining technical specifications, training datasets, and safety mitigations for Gemini Omni Flash.
Veo Prompt Guide - Official guidelines by Google DeepMind for crafting high-fidelity video generation prompts in Veo.
Ultimate Prompting Guide for Veo 3.1 - In-depth prompt engineering and styling handbook from the Google Cloud blog for Veo 3.1.
Interactive Platforms
Google Flow - Creative canvas and workspace enabling interactive collaboration and native video editing powered by Gemini Omni.
Capabilities and Showcases
Native Video Editing
LEGO and Historical Film Transfer - Demonstration of transforming the famous 1896 train film into LEGO style and adding custom elements natively.
Claymation and Anime Style Transfer - Video style alteration example showing adjustment into anime or claymation while preserving spatial motion.
Dynamic Logo and Text Tracking - Showcase of placing high-fidelity text at precise timestamps and rendering logos onto fast-moving tennis balls in Google Flow.
Video-to-Video Style Alteration - Native video editing test demonstrating high-fidelity video style adjustments.
Material Synthesis and Modification - Native material transformation using combined text prompts and video inputs.
Multimodal Video Generation
Google Maps Route to First-Person View - Synthesis of a first-person driving video based on a static map screenshot with a drawn route.
High-Speed Camera Zoom and World Knowledge - High-speed camera panning, zoom, and refocus simulation demonstrating deep spatial world knowledge in Gemini Omni Flash.
Single-Line Video Generation - Streamlined generation using ultra-compact single-line prompts.
Multimodal Interaction
Visual Question Answering and Object Identification - Interactive identification and reasoning of dynamic real-world objects.
Tutorials and Courses
AI Agents for Image and Video Generation - Short course focused on building AI agents that automatically generate and refine media outputs.
Contributing
Contributions are always welcome! Please read the contribution guidelines first.
Footnotes
This repository is curated and maintained by Chouaieb Nemri.
Read more articles and insights by Chouaieb Nemri on Medium.
About
A curated list of awesome Google Gemini Omni prompt guides, interactive platforms, and creative showcases.
Topics
awesome
gemini
awesome-list
multimodal
generative-ai
google-flow
gemini-omni
Resources
Readme
License
CC0-1.0 license
Contributing
Contributing
Uh oh!
There was an error while loading. Please reload this page.
Activity
Stars
star
Watchers
watching
Forks
forks
Report repository
Releases
No releases published
Packages
Uh oh!
There was an error while loading. Please reload this page.
Contributors
Uh oh!
There was an error while loading. Please reload this page.
You can’t perform that action at this time.