# AI-Atlas [<img alt="GitHub" src="https://img.shields.io/badge/dynamic/json?logo=github&label=GitHub+Followers&labelColor=282c34&color=181717&query=%24.data.totalSubs&url=https%3A%2F%2Fapi.spencerwoo.com%2Fsubstats%2F%3Fsource%3Dgithub%26queryKey%3DAI-Atlas&longCache=true"/>](https://github.com/AI-Atlas) <img src="https://img.shields.io/github/stars/AI-Atlas?label=Stars" alt="stars">
- 🤖 Assistance & Chat
- 🧑🏫 Learning
- 🎯 Productivity
- 🎧 Text to Speech
- 🎙️ Audio Generation
- 🪄 Design & Creativity
- 🪛 Engineering
- 👾 Code & Development
- 📄 Resume & Career
- 📊 Data & Monitoring
- 👔 Business & Marketing
- 📱 Social Media Tools
- 💸 Crypto
- 🔍 OCR
- 🧮 Financial & Accounting
- 🔬 Research & Academic
- 🩺 Health & Wellbeing
- 🗜️ Hardware
- 🧑🏫 Learning
- 🔮 Open-source & Self-hosted
- 📦 Models & LLMs
- 📒 Libraries
- 🛟 Hardware & Cloud
- 🎮 Game & Entertainment
- 🚙 Automations & Agents
- ⚙️ AI Tools
- 🧰 Others
- 📋 Directories & Communities
List of Tools
🤖 Assistance & Chat
-
ChatGPT
ChatGPT is a sibling model to InstructGPT, which is trained to follow an instruction in a prompt and provide a detailed response. -
Bard
Bard is a conversational generative artificial intelligence chatbot developed by Google. -
Claude AI
Claude is a next-generation AI assistant based on Anthropic’s research into training helpful, honest, and harmless AI systems. -
Llama
LLaMA (Large Language Model Meta AI), a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI. -
Microsoft Copilot
Microsoft Copilot for Microsoft 365 is a sophisticated processing and orchestration engine that provides AI-powered productivity capabilities. -
Perplexity
Perplexity AI is a powerful research tool that uses artificial intelligence technology to gather information from multiple sources on the web and provide accurate responses to user questions in natural language. It is designed to be user-friendly, allowing smooth navigation and quick access to relevant information. -
Chatbot UI
The open-source AI chat app for everyone. -
Gemini
Gemini is built from the ground up for multimodality — reasoning seamlessly across text, images, video, audio, and code. -
Tock
Tock (The Open Conversation Kit) is a complete and open platform to build conversational agents - also known as bots. -
NVIDIA ChatRTX
ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, or other data. -
LobeHub
LobeChat Unlock the superpower of your brain, pioneering the new age of thinking and creating. Built for you, the Super Individual. -
Claude
Claude is a next generation AI assistant built for work and trained to be safe, accurate, and secure. -
Google PaLM2
PaLM 2 is our next generation large language model that builds on Google’s legacy of breakthrough research in machine learning and responsible AI. -
Quickchat AI
Design, tweak, and deploy your own AI Assistant to automate customer support, lead generation, and much more. -
InstructLab A new community-based approach to build truly open-source LLMs. Github page
-
Verbi
A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech models. Supports OpenAI, Groq, Elevanlabs, CartesiaAI, and Deepgram APIs, plus local models via Ollama. Ideal for research and development in voice technology. -
You.com
Leverage a personal AI search agent & customized recommendations with You.com's AI chatbot. Converse naturally and discover the power of AI.
🧑🏫 Learning
-
Speak
Talk out loud, get instant feedback, and become fluent with the world’s most advanced AI language tutor.
🎯 Productivity
-
PopAi
A powerful AI tool that boosts productivity!Besides instant answers, explore search engine integration, PDF reading, Powerpoint generation and more! -
Magical AI
Auto-draft messages in 1-click, anywhere you have conversations. No annoying AI-training required. -
Writer
Transform work with full-stack generative AI, Accelerate growth across every team with the secure enterprise generative AI platform. -
Broadcast
Your own copilot. Capture meeting notes, track decisions, and automate followups. Enabling you to lead without busywork. -
Dart
Project management powered by AI, Level up your team's productivity with Dart's AI sidekick. -
Modelize.ai
Modelize.ai offers "Credits" that you can use to run tasks, chat with agents, and do more exciting things on the platform. -
botpress
The first next-generation chatbot builder powered by OpenAI. Build ChatGPT-like bots for your project or business to get things done. -
Worldtune
Wordtune is a Generative AI platform for work productivity. By using reliable GenAI, professionals from all fields can grow their careers and stand out at work. -
Motion
Use AI to plan your work, automatically. -
Audionotes
Speak or type, Audionotes will transform your notes into searchable clear actionable text notes using AI. -
NoteGPT
NoteGPT - YouTube Video Summarizer, PDF Summary, PPT Summary, Image Summaries, and more. Create PPTs, Mindmaps, and Notes with NoteGPT AI. Improve your learning efficiency by 10x. -
TurboSeek
Search smarter and faster with our open source AI search engine.
🎧 Text to Speech
-
LeptonAI
Lepton enables developers and enterprises to run AI applications efficiently in minutes, and at a production ready scale. -
MyShell
Democratizing & Decentralizing AI-native apps. MyShell is a decentralized and comprehensive platform for discovering, creating, and staking AI-native apps. -
Play.ht
Ultra realistic Text to Speech(TTS) voice. Leading AI Voice Generator. Free Unlimited downloads. Most Fluent & Conversational AI voices -
SoundHound AI
SoundHound’s Voice AI Technology Processes Speech Like the Human Brain. SoundHound’s proprietary technology was built to understand the complexity of speech and interpret meaning—just like the human brain. -
ChatTTS
ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. -
Sherpa Onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust. -
Bark Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects.
-
Fish Speech
Brand new TTS solution. -
Whisper Web
ML-powered speech recognition directly in your browser! Website
🎙️ Audio Generation
-
Stable Audio Tools
Training and inference code for audio generation models. -
Artist Voice Over The AI voice generator for video creators. Find your voice with the ultimate text-to-speech generator, featuring top voice actors exclusive to Artlist.
-
ElevenLabs
The ElevenLabs voice generator can deliver high-quality, human-like speech in 32 languages. Perfect for audiobooks, video voiceovers, commercials, and more.
🪄 Design & Creativity
-
runway
Tools for human imagination. A new suite of creative tools designed to turn the ideas in your head into reality. All made possible with AI models that can understand and generate worlds. -
stability.ai
Stability AI is the world’s leading open source generative AI company. We deliver breakthrough, open-access AI models with minimal resource requirements in imaging, language, code and audio.
Image Generator
-
Midjourney
Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species. -
DALL-E
DALL·E is an AI system that can create realistic images and art from a description in natural language. -
Microsoft Designer
A powerful and intuitive design tool that brings ideas to life with AI-powered design suggestions and one-of-a-kind AI-generated images from DALL-E. -
Stockimg AI
Stockimg is an all in one design and content creation tool powered by AI. You can easily generate logo, illustration, wallpaper, poster and more. -
getimg.ai
Easily generate images from text, edit photos with words, expand pictures beyond their borders, train custom AI models and much more. -
Ilus AI
Get beautiful, stylistically consistent illustrations in minutes. -
Omost
Omost is a project to convert LLM's coding capability to image generation (or more accurately, image composing) capability. -
Kolors
Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis. -
Looka
Use Looka's AI-powered platform to design a logo and brand you love. -
Flux AI
The best of FLUX.1, offering state-of-the-art performance image generation with top of the line prompt following, visual quality, image detail and output diversity. We are slowly ramping up our inference compute for FLUX.1 in our API. -
Krea
delightful creative tools with AI inside. -
LetsEnhance
Get clear, high-res images with AI. -
AI Emoji
Turn your ideas into emojis with AI Emoji Generator. Generate your favorite Slack or Discord emojis with just one click. -
Headshots AI
Elevate your online presence with HD headshots generated by our AI. Ideal for social profiles, resumes, and professional portfolios.
UI & UX
-
Diagram
Diagram is a design tools company reimagining UI design in the era of generative AI. We’re a small team of builders, creatives, and prototypers looking to grow individually and together. -
Studio Design
Go further than the speed of thought. STUDIO AI reads and understands your designs, and with nothing more than a single line of feedback, perform complex actions autonomously. -
uizard
Bring your product vision to life in minutes with the world's first AI-powered UI design tool - no design experience required! Sign up for free today. -
Galileo AI
Generate interface designs at lightning speed, Galileo AI is a UI generation platform for easy and fast design ideation. -
QoQo
QoQo.ai is an artificial intelligence to keep designers curious, organized and efficient. -
OpenUI
OpenUI aims to make the process fun, fast, and flexible. It's also a tool we're using at W&B to test and prototype our next generation tooling for building powerful applications on top of LLM's. -
THUMB ZONE
AI Copilot for Mobile User Testing. -
userevaluation
Seamlessly convert customer data into strategic assets using our all-encompassing AI analysis toolkit. -
Userology
Userology.co uses conversational AI to moderate usability testing sessions, helping you 10x the quality of insights much faster without human intervention. -
Thumb Zone
Eliminate the survival bias in your design decisions by looking into user persona, qualitative data, and user interaction. -
Open Canvas Open Canvas Chat UX by LangChain.
Viedo & Multi-Media
-
Runway
Runway is an applied AI research company shaping the next era of art, entertainment and human creativity. -
WOWBO
WOMBO.ai is a technology startup focused on delivering joy through consumer-focused AI experiences. We bring the power of computer vision, generative art and fun to the future of synthetic media. -
invideo AI
It generates a script, creates scenes, adds voiceovers, & tweaks the video at your command. With invideo AI as your co-pilot, engaging your audience is effortlessly simple! -
Synthesia
Create studio-quality videos with AI avatars and voiceovers in 130+ languages. It’s as easy as making a slide deck. -
FRAMEDROP
Framedrop is an AI tool that automatically finds the best moments in YouTube and Twitch content and turns it into short-form videos. This means no more long nights of going through your content and making TikToks/YouTube Shorts/Instagram Reels. Framedrop speeds up that process so that you can focus on what you love, creating content. -
Sora
Sora is an AI model that can create realistic and imaginative scenes from text instructions. -
Remini
Transformative technology gives your low-quality visuals a stunning HD upgrade. Restore old photos to incredible detail and elevate your content to a professional level. -
Novita
Explore the full spectrum of AI APIs tailored for image, video, audio, and LLM applications. Novita AI is designed to elevate your AI-driven business at the pace of technology, offering model hosting and training solutions. -
ToonCrafter
ToonCrafter can interpolate two cartoon images by leveraging the pre-trained image-to-video diffusion priors. Please check our project page and paper for more information. -
MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation. -
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images. -
FunClip
FunClip is a fully open-source, locally deployed automated video clipping tool. It leverages Alibaba TONGYI speech lab's open-source FunASR Paraformer series models to perform speech recognition on videos. -
Hallo
Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation. -
rapport
Rapport is an audio-driven facial animation technology company powered by our award-winning Speech Graphics engine. -
LivePortrait
Efficient Portrait Animation with Stitching and Retargeting Control. -
AI4Animation
Bringing Characters to Life with Computer Brains in Unity. -
invideoAI
Turn any idea into an attention-grabbing video instantly with invideo AI. -
Typeframes
Create videos for YouTube, Instagram, and TikTok with simple text prompts. Our AI-powered tool transforms text into stunning videos in minutes! Boost engagement, save money, and save time with Typeframes. -
Sapiens Sapiens offers a comprehensive suite for human-centric vision tasks (e.g., 2D pose, part segmentation, depth, normal, etc.). The model family is pretrained on 300 million in-the-wild human images and shows excellent generalization to unconstrained conditions. These models are also designed for extracting high-resolution features, having been natively trained at a 1024 x 1024 image resolution with a 16-pixel patch size.
-
Spotter Studio
Spotter Studio helps YouTube Creators create more winning videos through an integrated suite of ideation tools. -
HeyGen
Create and translate videos with AI. Produce studio-quality videos in 175 languages without a camera or crew.
Modeling & 3D
-
Genie
Genie allows anyone to generate realistic 3D models. The new tool can create 3D things from natural language prompts. -
Cap3D
Cap3D provides detailed descriptions of 3D objects by leveraging pretrained models in captioning, alignment, and LLM to consolidate multi-view information. -
InstantMesh
Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models -
Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image. -
3D AI Studio Generate 3D models, animations and textures in seconds. Drastically Reduce Time & Expense on 3D Models!
Icon
-
Magician
Design with the power of AI to do everything from copywriting to generating unique icons from text. -
Iconify AI
Create unique, beautiful and professional app icons with our AI icon generator. Convert text to ready to use logo easily just in seconds. -
IconMage
IconMage is an AI icon generator platform that allows you to create custom icons using a variety of styles and options. With IconMage, you can easily create icons for your website, app, or other projects. -
SVG.IO
Text prompt to SVG in less than 10 seconds.
Colors
-
Colormind
Generate color combinations in one click. Colormind creates cohesive color schemes using a deep neural net. -
Khroma
Khroma uses AI to learn which colors you like and creates limitless palettes for you to discover, search, and save.
Music & Sound
-
udio
Udio builds AI tools to enable the next generation of music creators. We believe AI has the potential to expand musical horizons and enable anyone to create extraordinary music. -
Suno
Suno is building a future where anyone can make great music. Whether you're a shower singer or a charting artist, we break barriers between you and the song you dream of making. No instrument needed, just imagination. From your mind to music.
🪛 Engineering
-
Leo
The world's first engineering design copilot powered by generative AI. If you can imagine it, I can design it, from parts to fully assembled products, with 3D CAD models you can edit anywhere. -
Globe
We are fixing engineering, using LLM Engineering Agents. And helping the LLM Agent community along the way. -
aino
AI-powered tool for urban consultants, data analysts, and spatial planners to gain insights from spatial data without requiring engineering skills. -
SnapMagic
Your AI Copilot for Electronics Design. Fly faster with your existing PCB design tool. Built on top of SnapMagic.
👾 Code & Development
Code Assistance
-
Github Copilot
GitHub Copilot can help you code by offering autocomplete-style suggestions. -
Deepnote AI
Deepnote offers contextual AI help for all your data projects. Learn more about the future of notebooks powered by AI assistance and deep learning. -
Codeium
A free AI powered toolkit for developers. -
Plandex
An open source, terminal-based AI coding engine that helps you complete large tasks, work around bad output, and maximize productivity. -
Replit AI
Automate the repetitive parts of coding, so you can stay focused on taking your idea to software. -
Sirji
Sirji is a Visual Studio Code extension that works as an AI software development agent. -
Mistral AI
Frontier AI in your hands. Open and portable generative AI for devs and businesses. -
kapa
Kapa.ai learns from your technical resources to generate an LLM-powered chatbot that answers developer questions automatically and helps you find gaps in your docs. -
Git
Assign tickets, get high-quality production code powered by AI agents and our developer community. -
CodeStory
CodeStory is an AI-powered mod of VSCode. Imagine a developer tasked with solving a bug, CodeStory can scan the codebase, identify the root cause, put up the fix and test the changes to ensure it doesn't repeat. -
Tempo Labs
Build Beautiful UIs 10X Faster, With AI. -
Codium
With CodiumAI, you get non-trivial tests (and trivial, too!) suggested right inside your IDE or Git platform, so you can code smart and stay confident when you push. -
Aider
Aider lets you pair program with LLMs, to edit code in your local git repository. -
amplication
Build new services, extend existing applications or modernize legacy systems with Amplication AI. Go from idea to production in minutes, with code that is built to scale. -
Tabine
Tabnine’s AI code assistant streamlines code generation and automates mundane tasks so developers can spend more time on the work they love. -
Melty
Melty is the first AI code editor that's aware of what you're doing from the terminal to GitHub, and collaborates with you to write production-ready code. -
Continue
Continue is the leading open-source AI code assistant inside of VS Code and JetBrains. -
GPT Pilot
GPT Pilot aims to research how much LLMs can be utilized to generate fully working, production-ready apps while the developer oversees the implementation.
Site & App Builder
-
V0.dev
v0 is a generative user interface system by Vercel powered by AI. It generates copy-and-paste friendly React code based on shadcn/ui and Tailwind CSS that people can use in their projects. -
Stunning
Websites. Build websites by typing in natural language. Allows user to customise the website ; Marketing. Generate Ad & Social Creatives with AI. -
Essai
Introducing AI no-code platform for building your websites in minutes, not hours. -
Glide
In Glide, building with AI is as simple as adding a column to a table. No need to manage prompts, choose models, deal with complex APIs, or cache results to optimize cost and performance –all of this is managed for you. -
MakeLanding
Write about your project and get an entire landing page generated in seconds! Copy that sells, unique logo and illustrations, beautiful icons. -
BuildAI
Build your own AI-powered web apps tailored to your business. No technical skills required. -
Locofy
Locofy.ai turns your designs into production-ready frontend code for mobile apps and web. It enables builders to ship products 10x faster with your existing design tools, tech stacks & workflows. -
Lazy
Build and modify web apps with prompts and deploy to the cloud with one click. -
Vercel AI
Build AI-powered applications with React, Svelte, Vue, and Solid. -
Builder.ai
Created a new category between low-code and custom software to provide flexible, bespoke apps at the speed and cost of an off-the-shelf product. -
Widgera
Widgera is a platform that empowers users to easily create personalized digital experiences. Its mission is to bring superpowers to digital presence creation, using Dynamic Interface Personalization to tailor content to user behavior, making every interaction unique and engaging. Ideal for SMEs, Widgera simplifies sophisticated web and app development. -
Wegic AI
Wegic is here to simplify the process of creating your ideal website through natural conversations. Just express your design preferences , adjustments, or tweaks, and Wegic will swiftly understand and cater to your needs. Moreover, Wegic allows you to effortlessly publish your site with a custom domain, making website creation as easy as chatting with a designer friend right beside you. -
UCRAFT
Create a website or online store in minutes with the intelligent eCommerce solution, Ucraft Next. -
bolt
Prompt, run, edit, and deploy full-stack web apps.s
Code Review
-
CodeRabbit
CodeRabbit is an innovative, AI-driven platform that transforms the way code reviews are done. Its automated reviews elevate the code quality while significantly reducing the time and effort tied to extensive manual code reviews. -
blockli
Blockoli is a high-performance tool for code indexing, embedding generation and semantic search tool for use with LLMs. blockoli is built in Rust and uses the ASTerisk crate for semantic code parsing. blockoli allows you to efficiently index, store, and search code blocks and their embeddings using vector similarity. -
Sourcegraph
Sourcegraph allows developers to rapidly search, write, and understand code by bringing insights from their entire codebase right into the editor. -
SWE Agent
SWE-agent turns LMs (e.g. GPT-4) into software engineering agents that can fix bugs and issues in real GitHub repositories.
Code editor
-
Curser
Built to make you extraordinarily productive, Cursor is the best way to code with AI.
📄 Resume & Career
-
AIApply
AIApply helps job seekers land more jobs quicker with the power of AI. Our Job Application Kit Generator is one of our tools that creates custom cover letters, rewrites resumes, and generates follow-up emails. -
Interview Warmup
A quick way to prepare for your next interview. Practice key questions, get insights about your answers, and get more comfortable interviewing. -
Final Round AI
Interview Copilot®️ generating actionable guidance in real-time. -
Interview with AI
Create a personalized interview preparation roadmap for the job description you pasted, see what you need to learn, solve quizzes, practice with AI like an actual online interview, and get a feedback at the end. -
Huntr
Huntr helps you create tailored resumes and cover letters fast with AI, fill out application forms in one click, and automatically organize your job search.
📊 Data & Monitoring
-
Browse AI
Extract data from any website and turn it into a spreadsheet or an API with No-Code. The easiest way to extract and monitor data from any website. Integration with 5000+ apps. -
Picarta
Predict where a photo has been taken in the world using Artificial Intelligence. -
Bardeen
Automation is now as easy as texting a friend, Meet Bardeen AI, delegate your tedious work with a few lines of text. -
HEX
Queries, notebooks, reports, data apps, and AI — all in the world’s leading collaborative data workspace. -
VortexAI Streamline your AI management with VortexAI's central hub for usage monitoring.
-
Deepnote
Deepnote offers contextual AI help for all your data projects. Learn more about the future of notebooks powered by AI assistance and deep learning. -
BrightData
Award-winning proxy networks, AI-powered web scrapers, and business-ready datasets for download. Welcome to the internet’s most trusted web data platform. -
Fluent
Fluent empowers your decision makers with AI: Ask data questions directly, get insights instantly. -
Sigma AI
Sigma AI is a global training data collection, preparation and annotation services company, specializing in Generative AI. We provide the highest quality training data at scale, with a human touch. -
Heron AI
Assess your impact, drive profit, and discover valuable insights automatically with AI.
👔 Business & Marketing
-
Ara
Autonomously predicts when audiences are ready to buy, bids on the most accurate inventory, and optimizes every campaign dollar. -
Sentione
Monitor online discussions that matter to your brand. By finding truly relevant insights with AI-based online listening and data analysis engine, you can fully manage your online brand image. -
NOAN
Develop, manage and deploy your company strategy with the AI operating system for business. -
Nexus
nexus enables anyone to build autonomous AI agents in minutes with no code. -
pipefy
Pipefy’s no-code platform is the fastest and most cost-effective way to boost your Procurement, HR and IT operations. -
Black Cow AI
Black Crow AI helps companies of all sizes improve profitability with the power of machine-learned prediction. We empower e-commerce brand growth by unlocking the hidden value in the customer data you already own. -
Midday
Midday provides you with greater insight into your business and automates the boring tasks, allowing you to focus on what you love to do instead. -
AdCreative
Give your business an unfair advantage with creatives / banners generated by highly trained Artificial Intelligence. -
Pecan
Pecan is a predictive analytics platform that leverages its pioneering Predictive GenAI to remove barriers to AI adoption, making predictive modeling accessible to all data and business teams. -
lang.ai
The AI for real-time CX insights tailored to your business complexity. Build trusted AI workflows beyond chatbots and make decisions backed with accurate, granular insights. -
Wordsmith
Make in-house legal accessible to the rest of the business and process workflows, documents and routine legal tasks in seconds. -
Yuma
Empowering Shopify merchants with automated AI customer service, boosting agent productivity while slashing costs. -
Workflow86
Automating complex business processes made effortless. -
FIELDGUIDE
The Fieldguide AI Platform for Advisory & Audit Firms saves time, increases margins, and improves client satisfaction. -
Flagright
Flagright is an AI-native, centralized, no-code compliance platform that transforms AML compliance and fraud prevention for fintechs and banks. -
Typewise
3X your customer service & sales productivity with our AI Communication Assistant. -
Khoros
Khoros connects every facet of customer engagement, including digital contact centers, messaging, chat, online brand communities, CX analytics, and social media management. Combined with our top-rated services, the Khoros platform enables brands to connect with customers throughout their entire digital journey. -
WiseWorld
An AI-simulated platform for soft skills assessment & development. WiseWorld helps employees practice, make mistakes, and improve their soft skills on a daily basis. -
Mendable
Train a secure AI on your technical resources that answers customer and employee questions so your team doesn't have to. -
YourGPT
Harness the Potential of Large Language Models (LLMs) for Business Innovation. -
Nory
Every ambitious restaurant business wants to open lots of venues. To do that successfully, you need consistency in operational standards and profitability across each venue. Nory is purpose built to help you achieve this. -
Beamup
BeamUP is the world's first Enterprise Facility Intelligence Platform to provide full facility observability and insights for data-driven decision making. The BeamUP platform is a global strategic solution that leverages unified, aggregated facility data and AI to empower physical security, IT and facilities management teams to make better decisions. -
ariglad
The first AI tool that auto-creates/updates your knowledge base from Zendesk, tickets, Slack, etc. -
Snorkel
Snorkel AI makes AI development fast and practical by transforming manual AI development processes into programmatic solutions. Snorkel AI enables enterprises to develop AI that works for their unique workloads using their proprietary data and knowledge 10-100x faster. -
windsor.ai
Connecting all marketing data with our marketing attribution software platform give marketers an increase in marketing ROI of 15-44%. Measure ROI for every channel, campaign, keyword and creative. -
Forecase
Forecast helps operational leaders plan, run and track projects in one place to optimize productivity and increase utilization. -
AdCreative
Generate ad creatives that outperform your competitors. -
Spot AI
Spot AI builds a modern AI camera system to create safer workplaces and smarter operations for every business. -
Placer.ai
Placer.ai is the most advanced foot traffic analytics platform allowing anyone with a stake in the physical world to instantly generate insights into any property for a deeper understanding of the factors that drive success. -
neocom
Neocom is to enable companies to understand and delight their customers with Guided Discovery. -
ChatBase
Build a custom GPT, embed it on your website and let it handle customer support, lead generation, engage with your users, and more.
📱 Social Media Tools
-
Postiz
Postiz offers everything you need to manage your social media posts, build an audience, capture leads, and grow your business.
💸 Crypto
-
wisdomise
Wisdomise is an AI powerhouse, driven to solve inefficiencies in web3 and decentralized economy. It aims at reconciling human and machine intelligence for the purpose of tokenizing the wisdom of crowds and democratizing DeFi for the masses.
🔍 OCR
-
Taggun Giving purpose to receipts with AI. Join the revolution of businesses transforming their receipts into powerful digital data.
-
Surya
Open-source OCR, layout analysis, reading order, line detection in 90+ languages. -
DataLab
We train AI models for OCR, layout analysis, PDF to markdown, and more. They're state of the art, easy to use, and open source.
🧮 Financial & Accounting
-
Brex AI
Reduce expense busywork by 10x while increasing compliance and accuracy. Brex AI eliminates manual work by automating tasks across our spend platform. -
FLYFIN
FlyFin is a consumer startup focused on automating the financial lives of US consumers. -
FinRobot
An Open-Source AI Agent Platform for Financial Applications using Large Language Models. -
FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 release the trained model on HuggingFace. -
OpenBB
OpenBB provides innovative, customizable, AI-powered solutions for analysts, quants, and other finance professionals. With a commitment to open-source development and user-centric design, OpenBB is redefining the status quo of investment research.
🔬 Research & Academic
-
Google Colab
Colab is a hosted Jupyter Notebook service that requires no setup to use and provides free access to computing resources, including GPUs and TPUs. Colab is especially well suited to machine learning, data science, and education. -
Explainpaper
Upload a paper, highlight confusing text, get an explanation. We make research papers easy to read.
🩺 Health & Wellbeing
-
Corti
Corti is a clinically proven AI guide that augments, automates, and analyzes virtual care and face-to-face patient engagements.
🗜️ Hardware
-
Friend
Open Source AI Wearable device that records everything you say, gives you proactive feedback and advice. 6+ days on single charge.
🧑🏫 Learning
-
LLM 101n
In this course we will build a Storyteller AI Large Language Model (LLM). Hand in hand, you'll be able create, refine and illustrate little stories with the AI. -
Generative AI for Beginners
18 Lessons teaching everything you need to know to start building Generative AI applications.
🔮 Open-source & Self-hosted
-
LibreChat
Enhanced ChatGPT Clone: Features OpenAI, GPT-4 Vision, Bing, Anthropic, OpenRouter, Google Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development. -
Upscayl
Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy. -
Burn
Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals. -
Open Sora
An initiative dedicated to efficiently produce high-quality video and make the model, tools and contents accessible to all. By embracing open-source principles, Open-Sora not only democratizes access to advanced video generation techniques, but also offers a streamlined and user-friendly platform that simplifies the complexities of video production. With Open-Sora, we aim to inspire innovation, creativity, and inclusivity in the realm of content creation. -
Open Devin
An open-source project aiming to replicate Devin, an autonomous AI software engineer who is capable of executing complex engineering tasks and collaborating actively with users on software development projects. This project aspires to replicate, enhance, and innovate upon Devin through the power of the open-source community. -
FreeAskInternet
FreeAskInternet is a completely free, private and locally running search aggregator & answer generate using LLM, Without GPU needed. The user can ask a question and the system will use searxng to make a multi engine search and combine the search result to the ChatGPT3.5 LLM and generate the answer based on search results. -
Stable Diffusion web UI
A web interface for Stable Diffusion, implemented using Gradio library. -
Storm
STORM is a LLM system that writes Wikipedia-like articles from scratch based on Internet search. While the system cannot produce publication-ready articles that often require a significant number of edits, experienced Wikipedia editors have found it helpful in their pre-writing stage. -
whisper
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. -
whisper.cpp
High-performance inference of OpenAI's Whisper automatic speech recognition (ASR) model. -
Devika
Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika utilizes large language models, planning and reasoning algorithms, and web browsing abilities to intelligently develop software. -
Mini-Gemini
In this work, we introduce Mini-Gemini, a simple and effective framework enhancing multi-modality Vision Language Models (VLMs). Despite the advancements in VLMs facilitating basic visual dialog and reasoning, a performance gap persists compared to advanced models like GPT-4 and Gemini. -
Meta Llama 3
We are unlocking the power of large language models. Our latest version of Llama is now accessible to individuals, creators, researchers, and businesses of all sizes so that they can experiment, innovate, and scale their ideas responsibly. -
Jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM) -
dot
dot (aka Deepfake Offensive Toolkit) makes real-time, controllable deepfakes ready for virtual cameras injection. dot is created for performing penetration testing against e.g. identity verification and video conferencing systems, for the use by security analysts, Red Team members, and biometrics researchers. -
LLaMa Factory
Unify Efficient Fine-Tuning of 100+ LLMs. -
Open Interpreter
Open Interpreter lets LLMs run code (Python, Javascript, Shell, and more) locally. You can chat with Open Interpreter through a ChatGPT-like interface in your terminal by running interpreter after installing. -
Open Voice
Instant voice cloning by MyShell. -
Tensor Flow
TensorFlow makes it easy to create ML models that can run in any environment. Learn how to use the intuitive APIs through interactive code samples. -
Open Diffusion
Consistent Self-Attention for Long-Range Image and Video Generation. -
Ultimate Vocal Remover GUI
GUI for a Vocal Remover that uses Deep Neural Networks. -
llama3.np
llama3.np is pure NumPy implementation for Llama 3 model. -
Vila
VILA is a visual language model (VLM) pretrained with interleaved image-text data at scale, enabling video understanding and multi-image understanding capabilities. -
Khoj
Khoj is an application that creates always-available, personal AI agents for you to extend your capabilities. -
RAGapp
As simple to configure as OpenAI's custom GPTs, but deployable in your own cloud infrastructure using Docker. -
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation. -
Open Interpreter
Open Interpreter lets LLMs run code (Python, Javascript, Shell, and more) locally. You can chat with Open Interpreter through a ChatGPT-like interface in your terminal by running $ interpreter after installing. -
Perplexica
Perplexica is an open-source AI-powered searching tool or an AI-powered search engine that goes deep into the internet to find answers. Inspired by Perplexity AI, it's an open-source option that not just searches the web but understands your questions. -
Claude
Claude Engineer is an advanced interactive command-line interface (CLI) that harnesses the power of Anthropic's Claude 3 and Claude 3.5 models to assist with a wide range of software development tasks. -
Cake
Distributed LLM inference for mobile, desktop and server. -
Neo4J Graph
Neo4j graph construction from unstructured data using LLMs. -
ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models. -
Llama Models Llama is an accessible, open large language model (LLM) designed for developers, researchers, and businesses to build, experiment, and responsibly scale their generative AI ideas.
-
Llama Agentic
Agentic components of the Llama Stack APIs. -
Groq MOA
This Streamlit application showcases the Mixture of Agents (MOA) architecture proposed by Together AI, powered by Groq LLMs. -
Fooocus
Fooocus is an image generating software (based on Gradio). -
GPT4ALL
GPT4All runs large language models (LLMs) privately on everyday desktops & laptops. -
torchchat
torchchat is a small codebase showcasing the ability to run large language models (LLMs) seamlessly. With torchchat, you can run LLMs using Python, within your own (C/C++) application (desktop or server) and on iOS and Android. -
Deep Live Cam
Real time face swap and one-click video deepfake with only a single image. -
MetaGPT
Assign different roles to GPTs to form a collaborative entity for complex tasks. -
Inference Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models.
-
AI Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. 🧑🔬 -
Agent Zero
Agent Zero is not pre-programmed for specific tasks (but can be). It is meant to be a general-purpose personal assistant. Give it a task, and it will gather information, execute commands and code, cooperate with other agent instances, and do its best to accomplish it. -
Quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, …) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework. -
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops) -
roop
Take a video and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training. -
LLM.c
LLM training in simple, raw C/CUDA. -
HivisionIDPhotos
HivisionIDPhotos: a lightweight and efficient AI ID photos tools. -
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs. -
AWS AI Stack AWS AI Stack – A ready-to-use, full-stack boilerplate project for building serverless AI applications on AWS.
-
Depth Pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second. -
MaxKB
MaxKB = Max Knowledge Base,It is an open source knowledge base question and answer system based on the LLM large language model. It is widely used in enterprise internal knowledge bases, customer services, academic research and education and other scenarios. -
LightRAG
Simple and Fast Retrieval-Augmented Generation. -
WhisperKit
WhisperKit is a Swift package that integrates OpenAI's popular Whisper speech recognition model with Apple's CoreML framework for efficient, local inference on Apple devices.
📦 Models & LLMs
-
Qwen
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud. -
CLIP
CLIP (Contrastive Language-Image Pre-Training) is a neural network trained on a variety of (image, text) pairs. It can be instructed in natural language to predict the most relevant text snippet, given an image, without directly optimizing for the task, similarly to the zero-shot capabilities of GPT-2 and 3. -
Goose AI
GooseAI is a fully managed NLP-as-a-Service, delivered via API. It is comparable to OpenAI in this regard. And even more, it is fully compatible with OpenAI's completion API! -
vLLM
vLLM is a fast and easy-to-use library for LLM inference and serving. -
IREE
IREE (Intermediate Representation Execution Environment1) is an MLIR-based end-to-end compiler and runtime that lowers Machine Learning (ML) models to a unified IR that scales up to meet the needs of the datacenter and down to satisfy the constraints and special considerations of mobile and edge deployments.
📒 Libraries
-
Apple CoreNet
CoreNet is a deep neural network toolkit that allows researchers and engineers to train standard and novel small and large-scale models for variety of tasks, including foundation models (e.g., CLIP and LLM), object classification, object detection, and semantic segmentation. -
Cognita
Langchain/LlamaIndex provide easy to use abstractions that can be used for quick experimentation and prototyping on jupyter notebooks. -
Copilot Kit
A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas. -
LSP-AI
LSP-AI is an open source language server that serves as a backend for performing completion with large language models and soon other AI powered functionality. Because it is a language server, it works with any editor that has LSP support. -
Mamba
Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. -
Meta Chamelon
A mixed-modal early-fusion foundation model from FAIR. -
Maestro
This Python script demonstrates an AI-assisted task breakdown and execution workflow using the Anthropic API. It utilizes two AI models, Opus and Haiku, to break down an objective into sub-tasks, execute each sub-task, and refine the results into a cohesive final output. -
Senabtic Kernel
Semantic Kernel is an SDK that integrates Large Language Models (LLMs) like OpenAI, Azure OpenAI, and Hugging Face with conventional programming languages like C#, Python, and Java. Semantic Kernel achieves this by allowing you to define plugins that can be chained together in just a few lines of code. -
Granite
IBM® Granite™ is a family of artificial intelligence (AI) models purpose-built for business, engineered from scratch to help ensure trust and scalability in AI-driven applications. Open source Granite models are available today. -
Watsonx
Readily build custom AI applications for your business, manage all data sources, and accelerate responsible AI workflows—all on one platform. -
candle
Candle is a minimalist ML framework for Rust with a focus on performance (including GPU support) and ease of use. -
Mem0
Mem0 provides an intelligent, adaptive memory layer for Large Language Models (LLMs), enhancing personalized AI experiences by retaining and utilizing contextual information across diverse applications. -
Haystack
Haystack is an end-to-end LLM framework that allows you to build applications powered by LLMs, Transformer models, vector search and more. Whether you want to perform retrieval-augmented generation (RAG), document search, question answering or answer generation, Haystack can orchestrate state-of-the-art embedding models and LLMs into pipelines to build end-to-end NLP applications and solve your use case. -
LitServe
LitServe is an easy-to-use, flexible serving engine for AI models built on FastAPI. It augments FastAPI with features like batching, streaming, and GPU autoscaling eliminate the need to rebuild a FastAPI server per model. -
Dify
Dify is an open-source LLM app development platform. Orchestrate LLM apps from agents to complex AI workflows, with an RAG engine. More production-ready than LangChain. -
AnythingLLM
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more. -
All Hands
Use AI to tackle the toil in your backlog, so you can focus on what matters: hard problems, creative challenges, and over-engineering your dotfiles. -
Llama Stack
The Llama Stack defines and standardizes the building blocks needed to bring generative AI applications to market. These blocks span the entire development lifecycle: from model training and fine-tuning, through product evaluation, to building and running AI agents in production. -
exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚ -
BISHENG
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more. -
RAGFlow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. When integrated with LLMs, it is capable of providing truthful question-answering capabilities, backed by well-founded citations from various complex formatted data. -
MindStudio
Rapidly build custom AI applications and automations — no coding required. Easily mix and match the latest models from OpenAI, Anthropic, Google, Mistral, Meta, and more. -
gradio
Gradio is the fastest way to demo your machine learning model with a friendly web interface so that anyone can use it, anywhere! -
Swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. -
bitnet.app
bitnet.cpp is the official inference framework for 1-bit LLMs (e.g., BitNet b1.58). It offers a suite of optimized kernels, that support fast and lossless inference of 1.58-bit models on CPU (with NPU and GPU support coming next).
🛟 Hardware & Cloud
-
RunPod
Globally distributed GPU cloud built for production. Develop, train, and scale AI applications. -
Lightning
Code together. Prototype. Train. Deploy. Host AI web apps. From your browser - with zero setup. Lightning AI introduces a paradigm shift to AI development. Studio integrates your favorite ML tools into a single cohesive experience. It also eliminates the environment discrepancy between local code which runs on the cloud. This allows for trivial multi-node, scalable AI web apps, endpoints and more. -
Tasking AI
The developer-friendly cloud platform for building and running LLM agents for AI-native applications. -
golem
Golem Network is an open-source and decentralized platform where everyone can use and share each other's computing power without relying on centralized entities like cloud computing corporations.
🎮 Game & Entertainment
-
Campfire
Our first product is a game with LLM-based AI agents at the heart of the gameplay experience along with the tools to realize these worlds. -
DIAMOND
DIAMOND 💎 (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained entirely in a diffusion world model.
🚙 Automations & Agents
-
Langflow
Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.
⚙️ AI Tools
-
Google AI Studio
Google AI Studio is a browser-based IDE for prototyping with generative models. Google AI Studio lets you quickly try out models and experiment with different prompts. When you've built something you're happy with, you can export it to code in your preferred programming language, powered by the Gemini API. -
ENCORD
Encord is the leading data development platform for computer vision & multimodal AI teams. Intelligently manage, clean and curate data, streamline your labeling and workflow management, and evaluate model performance. -
Chat with Data
The Chat with your data Solution accelerator is a powerful tool that combines the capabilities of Azure AI Search and Large Language Models (LLMs) to create a conversational search experience. -
Chatbot
Take your chatbot beyond traditional boundaries with ChatBot Studio. Design dynamic flows to make your chatbot smarter and more responsive. -
Kong
Kong or Kong API Gateway is a cloud-native, platform-agnostic, scalable API Gateway distinguished for its high performance and extensibility via plugins. It also provides advanced AI capabilities with multi-LLM support. -
Llama FS
LlamaFS is a self-organizing file manager. It automatically renames and organizes your files based on their contents and well-known conventions (e.g., time). It supports many kinds of file, and even images (through Moondream) and audio (through Whisper). -
truefoundry
Build fast, secure and cost-efficient ML/LLM Apps. TrueFoundry takes care of the tricky details of production machine learning so you can focus on using ML to deliver value. -
PhotoPrism
PhotoPrism® is an AI-Powered Photos App for the Decentralized Web. It makes use of the latest technologies to tag and find pictures automatically without getting in your way. You can run it at home, on a private server, or in the cloud. -
Lightning AI
Code together. Prototype. Train. Deploy. Host AI web apps. From your browser - with zero setup. -
ComfyUI
The most powerful and modular stable diffusion GUI and backend. This ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. -
MindsDB
MindsDB is the open-source orchestration platform connecting AI and enterprise data, helping developers customize their AI solutions. -
Chatpad
Not just another ChatGPT user-interface! -
Gumloop
Drag, drop, and deploy custom tools your business needs. Automate any workflow with AI. No AI expertise needed. -
Unstract No-code platform to eliminate manual processes involving unstructured data and document using the power of LLMs.
-
Trieve
All-in-one solution for building search, discovery, and RAG combining leading search language models + tools for tuning quality. -
FireCrawl
Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API. -
kotaemon
An open-source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind. -
Taipy
From simple pilots to production-ready web applications in no time. No more compromise on performance, customization, and scalability. -
unriddle
Quickly find info in research papers, simplify complex topics, write with AI and keep everything organized. Save four hours on your next paper. -
Storm
STORM is a research prototype for automating the knowledge curation process. We now support human-in-the-loop! -
crawl4ai
Crawl4AI simplifies asynchronous web crawling and data extraction, making it accessible for large language models (LLMs) and AI applications. -
screenpipe
screenpipe records your screens and mics 24/7 safely on your computer, stores it locally, and connects it to AI. -
FlowiseAI
Open source low-code tool for developers to build customized LLM orchestration flow & AI agents. -
FastGPT
A free, open-source, and powerful AI knowledge base platform, offers out-of-the-box data processing, model invocation, RAG retrieval, and visual AI workflows. Easily build complex LLM applications. -
gptme
gptme is a tiny command-line application that allows you to interact with AI agent equipped with powerful local tools, acting as a copilot for your computer, via the terminal. It can execute python and bash, edit local files, search and browse the web. -
OpenRouter
A unified interface for LLMs. Find the best models & prices for your prompts. -
ultralytics
YOLO11 is built on cutting-edge advancements in deep learning and computer vision, offering unparalleled performance in terms of speed and accuracy. Its streamlined design makes it suitable for various applications and easily adaptable to different hardware platforms, from edge devices to cloud APIs. -
Deep Live Cam
Real-time face swap and video deepfake with a single click and only a single image.
🧰 Others
-
Chatgot
Chat Freely, Got Every AI Assistants Here for You. -
Future Tools
FutureTools collects & organizes all the best AI tools so you too can become superhuman! -
WizyChat
Instant answers for your customers and your team with personalized AI chatbots trained with your data. -
Paperspace
Paperspace, now part of DigitalOcean, is a high-performance cloud computing and ML development platform for building, training and deploying machine learning models. -
unify
The Best LLM on Every Prompt. Not Sure Which Model to Use? Automatically Use The Best Model for Your Task on Every Prompt. -
Poe
Talk to ChatGPT, GPT-4, Claude 3 Opus, DALLE 3, and millions of others - all on Poe. -
CommandBar
The only platform for non-annoying user assistance, helping product, marketing, and customer teams unleash their users. -
FIXIE
Human communication is messy. We interrupt, talk over each other, and don't always wait our turn. But this rapid, messy exchange of ideas serves as the backbone of human progress. LLMs are revolutionary, but their potential impact is currently limited to situations where text-based chat is sufficient. -
hume Empathic AI to serve human well-being. With a single API call, interpret emotional expressions and generate empathic responses. Meet the first AI with emotional intelligence.
-
phidata
Phidata is a framework for building AI Assistants with memory, knowledge and tools. -
MapGPT
Deliver natural conversations with a location-intelligent AI assistant. -
QuillBot
QuillBot is an AI-powered writing platform helping more than 35 million monthly active users across 150 countries. With its innovative human-in-the-loop products, QuillBot aims to make writing painless while preserving the user's unique perspective and voice. -
wonderchat
Instantly build AI chatbots from your knowledge base. -
Toko
Learn English by speaking with an AI. Practice realistic conversations. Get instant grammar feedback. Anytime, anywhere. -
Cove
Cove is a wide open space where you can think together with AI. -
MiniCPM
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone. -
Scrapegraph-ai
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.). -
StreamV2V
Our StreamV2V could perform real-time video-2-video translation on one RTX 4090 GPU. -
MatMul
MatMul-Free LM is a language model architecture that eliminates the need for Matrix Multiplication (MatMul) operations. This repository provides an implementation of MatMul-Free LM that is compatible with the 🤗 Transformers library. -
Spreeedsheet nanoGPT
A nanoGPT pipeline packed in a spreadsheet. This is a project that I did to help myself understand how GPT works. It is pretty fun to play with, especially when you are trying to figure out what exactly is going on inside a transformer. This helped me to visualize the entire structure and the data flow. All the mechanisms, calculations, matrices inside are fully interactive and configurable. -
Depth Anything V2
This work presents Depth Anything V2. It significantly outperforms V1 in fine-grained details and robustness. Compared with SD-based models, it enjoys faster inference speed, fewer parameters, and higher depth accuracy. -
Micrograd
A tiny Autograd engine (with a bite! :)). Implements backpropagation (reverse-mode autodiff) over a dynamically built DAG and a small neural networks library on top of it with a PyTorch-like API. -
DiffSynth
DiffSynth Studio is a Diffusion engine. We have restructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performance. -
LOOKWISE
Where Fashion Meets Future! -
Wanderboat
Wanderboat AI builds technologies that leverage deep search and generative AI to help people reconnect with the world. Explore, plan and share, your personalized AI companion starts from here! -
PetSpotR
PetSpotR allows you to use advanced AI models to report and find lost pets. It is a sample application that uses Azure Machine Learning to train a model to detect pets in images. -
GraphRAG
The GraphRAG project is a data pipeline and transformation suite that is designed to extract meaningful, structured data from unstructured text using the power of LLMs. -
ARC-AGI
ARC can be seen as a general artificial intelligence benchmark, as a program synthesis benchmark, or as a psychometric intelligence test. It is targeted at both humans and artificially intelligent systems that aim at emulating a human-like form of general fluid intelligence. -
PDF Extract Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction. -
llm colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM. -
GPT Prompt Engineer
Prompt engineering is kind of like alchemy. There's no clear way to predict what will work best. It's all about experimenting until you find the right prompt. gpt-prompt-engineer is a tool that takes this experimentation to a whole new level. -
Jina
Jina AI is a leading search AI company. We provide the Search Foundation, the core for GenAI and multimodal applications. -
Reply Guy
ReplyGuy finds the perfect conversations to mention your product and drafts suggested replies. -
Composio
Empower your AI agents with Composio - a platform for managing and integrating tools with LLMs & AI agents using Function Calling. -
Prompt Engineering Guide
Prompt engineering is a relatively new discipline for developing and optimizing prompts to efficiently use language models (LMs) for a wide variety of applications and research topics. Prompt engineering skills help to better understand the capabilities and limitations of large language models (LLMs). -
LiveKit
Build realtime AI. Instantly transport audio and video between LLMs and your users.
📋 Directories & Communities
-
THERE'S AN AI FOR THAT
There is an ai for that list. -
Hugging Face
The AI community building the future. The platform where the machine learning community collaborates on models, datasets, and applications.
Contribution
The list is not complete also with the speed of the AI industry, there is a lot to add so feel free to contribute and help develop the atlas.
Coming Soon:
- Pricing
- Tools Description
- Usage Detail
Created by Mosn
© 2024 • Contents under MIT License • Credits