Blog
-
LLama Agents by LLamaIndex
Let’s take the common pattern of agents interacting with tools, and turning them into microservices. In llama-agents we allow you to setup both an agent service, which takes in a user input and performs reasoning about the next task to solve, and also a tool service, which can execute any variety of tools and exposes…
-
Building a Basic AI Agent
In LlamaIndex, an agent is a semi-autonomous piece of software powered by an LLM that is given a task and executes a series of steps towards solving that task. It is given a set of tools, which can be anything from arbitrary functions up to full LlamaIndex query engines, and it selects the best available…
-
LLama-3 vs Phi-3
In the ongoing quest to create the most powerful and efficient AI models, a new champion has emerged from the ranks of the small guys. Microsoft’s Phi-3, a tiny AI model by industry standards, has been racking up impressive wins, including surpassing Meta’s much larger LLama-3 model in key benchmarks. This development suggests a shift…
-
LlamaFS
Today’s highlight is LlamaFS – a self-organizing file manager. Given a directory of messy files (e.g. your ~/Downloads directory), it will automatically and reorganize the entire set of files into an organized directory structure with interpretable names. It can “watch” your directory and intercept all FS operations to proactively learn how you rename files. It’s…
-
LLama 3 – 400B
The AI research community has been abuzz with the recent release of Llama 3, a series of language models that promise to revolutionize the field of natural language processing. The initial release of Llama 3 8B and 70B models marked the beginning of an exciting journey, with even more impressive models on the horizon. In…
-
LlamaIndex on Vertex AI
LlamaIndex Team excited to partner with the Vertex AI team (@googlecloud) to feature a brand-new RAG API on Vertex, powered by @llama_index advanced modules that enable e2e indexing, embedding, retrieval, and generation. It is simultaneously easy to setup and use, while providing developers programmatic flexibility to connect a range of data sources (local, GCS, GDrive)…
-
Building JavaScript agents in LlamaIndex.TS
The ultimate guide to building agents in TypeScript is here! This guide takes you step-by-step through: What is an Agent? In LlamaIndex, an agent is a semi-autonomous piece of software powered by an LLM that is given a task and executes a series of steps towards solving that task. It is given a set of…
-
Optimizing RAG with LLamaIndex
A cool trick you can use to improve retrieval performance in your RAG pipelines is fine-tune the embedding model (bi-encoder) based on labels from a cross-encoder 💡 Cross-encoders are crucial for reranking but are way too slow for retrieving over large numbers of documents. This fine-tuning technique gives you all the speed advantages of direct…