The latest Gemini Nano with on-device ML Kit GenAI APIs

The latest Gemini Nano with on-device ML Kit GenAI APIs

Home » News » The latest Gemini Nano with on-device ML Kit GenAI APIs
Table of Contents

Posted by Caren Chang – Developer Relations Engineer, Joanna (Qiong) Huang – Software program Engineer, and Chengji Yan – Software program Engineer

The newest model of Gemini Nano, our strongest multi-modal on-device mannequin, simply launched on the Pixel 10 machine collection and is now accessible via the ML Equipment GenAI APIs. Combine capabilities resembling summarization, proofreading, rewriting, and picture description instantly into your apps.

With GenAI APIs we’re centered on providing you with entry to the newest model of Gemini Nano whereas offering constant high quality throughout gadgets and mannequin upgrades. Right here’s a sneak peak behind the scenes of among the issues we’ve achieved to realize this.

Adapting GenAI APIs for the newest Gemini Nano

We wish to make it as straightforward as potential so that you can construct AI powered options, utilizing essentially the most highly effective fashions. To make sure GenAI APIs present constant high quality throughout totally different mannequin variations, we make many behind the scenes enhancements together with rigorous evals and adapter coaching.

  1. Analysis pipeline: For every supported language, we put together an analysis dataset. We then benchmark the evals via a mixture of: LLM-based raters, statistical metrics and human raters.
  2. Adapter coaching: With outcomes from the analysis pipeline, we then decide if we have to prepare feature-specific LoRA adapters to be deployed on high of the Gemini Nano base mannequin. By delivery GenAI APIs with LoRA adapters, we guarantee every API meets our high quality bar whatever the model of Gemini Nano working on a tool.

The newest Gemini Nano efficiency

One space we’re enthusiastic about is how this up to date model of Gemini Nano pushes efficiency even larger, particularly the prefix pace – that’s how briskly the mannequin processes enter.

For instance, listed here are outcomes when working text-to-text and image-to-text benchmarks on a Pixel 10 Professional.

Prefix Velocity – Gemini nano-v2 on Pixel 9 Professional Prefix Velocity – Gemini nano-v2* on Pixel 10 Professional Prefix Velocity – Gemini nano-v3 on Pixel 10 Professional
Textual content-to-text 510 tokens/second 610 tokens/second 940 tokens/second
Picture-to-text 510 tokens/second + 0.8 seconds for picture encoding 610 tokens/second + 0.7 seconds for picture encoding 940 tokens/second + 0.6 seconds for picture encoding

*Experimentation with Gemini nano-v2 on Pixel 10 Professional for benchmarking functions. All Pixel 10 Professionals launched with Gemini nano-v3.

The way forward for Gemini Nano with GenAI APIs

As we proceed to enhance the Gemini Nano mannequin, the group is dedicated to utilizing the identical course of to make sure constant and top quality outcomes from GenAI APIs.

We hope it will considerably cut back the trouble to combine Gemini Nano in your Android apps whereas nonetheless permitting you to take full benefit of latest variations and their improved capabilites.

Study extra about GenAI APIs

Begin implementing GenAI APIs in your Android apps at present with steerage from our official documentation and samples: GenAI API Catalog and ML Equipment GenAI APIs quickstart samples.


Supply hyperlink

author avatar
roosho Senior Engineer (Technical Services)
I am Rakib Raihan RooSho, Jack of all IT Trades. You got it right. Good for nothing. I try a lot of things and fail more than that. That's how I learn. Whenever I succeed, I note that in my cookbook. Eventually, that became my blog. 

share this article.

Enjoying my articles?

Sign up to get new content delivered straight to your inbox.

Please enable JavaScript in your browser to complete this form.
Name