Show HN: Gemini Omni – A curated list of native multimodal guides and showcases

GitHub - cnemri/awesome-gemini-omni: A curated list of awesome Google Gemini Omni prompt guides, interactive platforms, and creative showcases. · GitHub

/" data-turbo-transient="true" />

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Clear

Search syntax tips

Provide feedback

--> We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

/;ref_cta:Sign up;ref_loc:header logged out"}" Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

cnemri

awesome-gemini-omni

Public

Notifications You must be signed in to change notification settings

Fork

Star

main

BranchesTags

Go to file

CodeOpen more actions menu

Folders and files NameNameLast commit message Last commit date Latest commit

History 2 Commits 2 Commits

media

CONTRIBUTING.md

LICENSE

README.md

View all files

Repository files navigation

Awesome Gemini Omni

Gemini Omni is Google's next-generation, natively multimodal AI model capable of seamlessly processing and generating text, code, images, audio, and video. The Gemini Omni Flash model is also officially available to try directly in the Gemini App.

Contents

Official Resources

Interactive Platforms

Capabilities and Showcases

Tutorials and Courses

Official Resources

Official Product Page - Official overview of the Gemini Omni model architecture, native multimodality, and core features.

Prompt Guide - Official comprehensive guidelines by Google DeepMind for designing effective multimodal prompts.

Model Card - Official model card outlining technical specifications, training datasets, and safety mitigations for Gemini Omni Flash.

Veo Prompt Guide - Official guidelines by Google DeepMind for crafting high-fidelity video generation prompts in Veo.

Ultimate Prompting Guide for Veo 3.1 - In-depth prompt engineering and styling handbook from the Google Cloud blog for Veo 3.1.

Interactive Platforms

Google Flow - Creative canvas and workspace enabling interactive collaboration and native video editing powered by Gemini Omni.

Capabilities and Showcases

Native Video Editing

LEGO and Historical Film Transfer - Demonstration of transforming the famous 1896 train film into LEGO style and adding custom elements natively.

Claymation and Anime Style Transfer - Video style alteration example showing adjustment into anime or claymation while preserving spatial motion.

Dynamic Logo and Text Tracking - Showcase of placing high-fidelity text at precise timestamps and rendering logos onto fast-moving tennis balls in Google Flow.

Video-to-Video Style Alteration - Native video editing test demonstrating high-fidelity video style adjustments.

Material Synthesis and Modification - Native material transformation using combined text prompts and video inputs.

Multimodal Video Generation

Google Maps Route to First-Person View - Synthesis of a first-person driving video based on a static map screenshot with a drawn route.

High-Speed Camera Zoom and World Knowledge - High-speed camera panning, zoom, and refocus simulation demonstrating deep spatial world knowledge in Gemini Omni Flash.

Single-Line Video Generation - Streamlined generation using ultra-compact single-line prompts.

Multimodal Interaction

Visual Question Answering and Object Identification - Interactive identification and reasoning of dynamic real-world objects.

Tutorials and Courses

AI Agents for Image and Video Generation - Short course focused on building AI agents that automatically generate and refine media outputs.

Contributing

Contributions are always welcome! Please read the contribution guidelines first.

Footnotes

This repository is curated and maintained by Chouaieb Nemri.

Read more articles and insights by Chouaieb Nemri on Medium.

About

A curated list of awesome Google Gemini Omni prompt guides, interactive platforms, and creative showcases.

Topics

awesome

gemini

awesome-list

multimodal

generative-ai

google-flow

gemini-omni

Resources

Readme

License

CC0-1.0 license

Contributing

Uh oh!

There was an error while loading. Please reload this page.

Activity

Stars

star

Watchers

watching

Forks

forks

Report repository

Releases

No releases published

Packages

Uh oh!

There was an error while loading. Please reload this page.

Contributors

Uh oh!

There was an error while loading. Please reload this page.

You can’t perform that action at this time.

Show HN: Gemini Omni – A curated list of native multimodal guides and showcases

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits