2024 Q4 Most Exciting LLM-powered Projects

Following up on the list of 2023’s, Q1 2024’s, Q2 2024’s, Q3 2024’s most exciting projects, here is the list of the most useful, innovative and exciting LLM-powered projects I found in 2024 Q4.

Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper 🕷️

Crawl4AI (11K stars) is a powerful, open-source Python asynchronous web crawling and data extraction library designed for large language models (LLMs) and AI applications. It offers blazing-fast performance, outperforming many paid services, while providing LLM-friendly output formats like JSON, cleaned HTML, and markdown.

Key features include:

  • multi-URL crawling
  • media tag extraction
  • custom hooks for authentication and page modifications
  • user-agent customization
  • screenshot capture
  • various chunking and extraction strategies

Crawl4AI excels in complex scenarios like session management and dynamic content crawling, making it ideal for tasks such as analyzing GitHub commits across multiple pages.

The library is particularly useful for developers working on AI-driven web scraping projects, offering a free alternative to paid services with comparable or better performance.

Users can try out Crawl4AI using the provided Colab notebook or explore its capabilities through the comprehensive documentation.

Firecrawl: Turns websites into LLM-ready markdown 🔥

Firecrawl is a powerful web crawling and data extraction API service designed for AI applications. It offers advanced scraping, crawling, and structured data extraction capabilities, converting web content into clean markdown or structured data formats ideal for large language models (LLMs).

Key features include multi-URL crawling, LLM-ready output formats, proxy support, anti-bot handling, and customizable extraction options. Firecrawl excels in reliability and customizability, offering features like custom headers for authentication, PDF and image parsing, and interactive page actions. It’s particularly useful for developers building AI-powered web scraping projects, offering both a cloud-based API service and open-source options.

Users can easily try Firecrawl through its playground or explore its capabilities via the comprehensive documentation. The project also provides SDKs for multiple programming languages and integrations with popular LLM frameworks, making it a versatile tool for various web data extraction needs.

AI Hawk: Auto jobs applier

Auto_Jobs_Applier_AIHawk is a beta version AI-powered job search assistant that automates the job application process. It offers features like intelligent job search automation, rapid application submission, AI-powered personalization for resumes and cover letters, and bulk application capabilities with quality control measures.

The tool is designed to streamline the job hunting process by automatically searching for relevant positions, filling out application forms, and even generating tailored resumes for each application. It supports various LLM models including OpenAI’s GPT, Ollama, Claude, and Gemini, allowing users to customize their experience. The project includes detailed configuration options for job search parameters, resume information, and LLM settings. It is particularly useful for job seekers looking to efficiently apply to multiple positions while maintaining personalization in their applications.

Users can try out the tool by following the installation instructions and configuring their job search preferences in the provided YAML files. The project’s GitHub repository offers comprehensive documentation, troubleshooting guides, and community support for users getting started with this automated job application tool. It’s also an interesting tool to study if you are working on agentic solutions.

Trigger.dev: Long-running background jobs without ⌛

Trigger.dev is an open-source platform and SDK for creating long-running background jobs without timeouts. It allows developers to write normal async code in JavaScript or TypeScript, deploy it, and never hit a timeout.

Key features include reliability by default, no infrastructure management, and compatibility with existing tech stacks. Trigger.dev integrates directly into your codebase, allowing for version control, local development, testing, and code review using familiar processes. The platform supports multiple environments (Development, Staging, Production) and provides full visibility of every job run through a detailed trace view. Trigger.dev offers both cloud-based and self-hosted options, making it flexible for various deployment needs.

Gateway: Blazing Fast AI Gateway 🤝🏼

Portkey AI Gateway is an open-source platform that provides a unified API for routing requests to over 200 language, vision, audio, and image models from various providers. It offers production-ready features such as caching, fallbacks, retries, timeouts, and load balancing, with the ability to be edge-deployed for minimal latency.

Key features include blazing fast performance (9.9x faster than direct API calls), a tiny footprint (about 100kb build), load balancing across multiple models and providers, automatic retries, configurable timeouts, and support for multimodal AI tasks. The gateway is compatible with OpenAI API and SDKs, making it easy to integrate into existing projects. It supports multiple deployment options, including a hosted version, self-hosted open-source version, and an enterprise version with additional security and management features.

The project is actively maintained, has processed over 480 billion tokens, and is used by companies like Postman, Haptik, and Turing. Users can get started quickly with the hosted API, or deploy the open-source version using npm or other deployment methods. The AI Gateway also integrates well with popular agent frameworks and offers extensive documentation and community support for developers.

More updates coming soon

Revisit this article for more updates and let me know in the comments if you think I missed anything interesting.

Panagiotis

Written By

Panagiotis (pronounced Panayotis) is a passionate G(r)eek with experience in digital analytics projects and website implementation. Fan of clear and effective processes, automation of tasks and problem-solving technical hacks. Hands-on experience with projects ranging from small to enterprise-level companies, starting from the communication with the customers and ending with the transformation of business requirements to the final deliverable.