Hacker News

Show HN: Nexa SDK – Build powerful and efficient AI apps on edge devices

Hacker News - Sun, 09/08/2024 - 2:03pm

Hey HN! Alex and Zack here from Nexa AI. We're excited to share something we've been working on.

Our journey began with the Octopus series --- action models for mobile AI agents [https://huggingface.co/NexaAIDev/Octopus-v2]. We focused on making sub-billion parameter models excel at function calling, making high accurate and fast function-calling possible on mobile and edge devices. But as we delved into developing full-fledged on-device applications, we hit a roadblock.

We realized that optimizing for function calling (tool-use) alone wasn't enough. Building powerful on-device AI apps requires a diverse set of tools: language models with domain expertise, speech processing, image generation, embedding models and more. That's when we decided to create Nexa SDK --- a comprehensive toolkit that brings together everything developers need to build powerful and efficient AI applications that run entirely on-device.

Here's what Nexa SDK offers:

- Support for both ONNX and GGML models. - An integrated conversion engine for making custom GGML Quantized Models for different device hardware requirements. - An inference engine that supports language models, image generation models, TTS, audio generation models, and Vision-Language Models. - An OpenAI-compatible API server with optimization in function calling. - A Streamlit UI for rapid prototyping. - An intuitive CLI for easy model management. - Backend optimizations for latency and power consumption on edge devices. We've designed Nexa SDK to be the go-to solution for developers pushing the boundaries of what's possible with on-device AI applications and AI on edge devices.

To showcase its capabilities, we've built several demo apps running entirely on your device:

- AI soulmate with uncensored model and audio-in/audio-out interaction. [https://github.com/NexaAI/nexa-sdk/tree/main/examples/ai_soulmate] - A quick interface for uploading and chatting with PDFs like your personal finance documents.[https://github.com/NexaAI/nexa-sdk/tree/main/examples/financial-advisor] - A meeting transcription app supporting multiple languages and real-time translation. [https://github.com/NexaAI/nexa-sdk/tree/main/examples/voice_transcription] We're proud to share that the winner of yesterday's (Sep 7) House AGI "AI PC/ GenAI Goes Local" hackathon used Nexa SDK to build a local semantic image search. [GitHub - asl3/deja-view: Semantic Image Search]

But we're just getting started! There are lots of exciting developments in our pipeline, and we can't wait to share them with you soon!

Check it out: https://github.com/NexaAI/nexa-sdk Docs: https://docs.nexaai.com/

If you're excited about the future of on-device AI, we'd really appreciate your support. A star on our GitHub repo goes a long way in helping us reach more developers!

Cheers, Alex & Zack

Comments URL: https://news.ycombinator.com/item?id=41481949

Points: 1

# Comments: 0

Categories: Hacker News

The Story Behind Fenster

Hacker News - Sun, 09/08/2024 - 1:53pm

Article URL: https://zserge.com/posts/fenster/

Comments URL: https://news.ycombinator.com/item?id=41481882

Points: 1

# Comments: 0

Categories: Hacker News

Tailscale SSH

Hacker News - Sun, 09/08/2024 - 1:52pm
Categories: Hacker News

Ask HN: Is anyone making use of the HTTP QUERY verb?

Hacker News - Sun, 09/08/2024 - 1:44pm

Comments URL: https://news.ycombinator.com/item?id=41481834

Points: 4

# Comments: 0

Categories: Hacker News

Pages