Skip to content
Change the repository type filter

All

    Repositories list

    • This action simplify creating of release PR
      JavaScript
      Apache License 2.0
      0000Updated Nov 27, 2024Nov 27, 2024
    • Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
      Python
      Apache License 2.0
      3204.7k819Updated Nov 27, 2024Nov 27, 2024
    • This project is the home of Apify's documentation.
      API Blueprint
      Apache License 2.0
      76297025Updated Nov 27, 2024Nov 27, 2024
    • The Github action that makes sure that each PR is correctly set up and has a milestone set.
      TypeScript
      Apache License 2.0
      1111Updated Nov 27, 2024Nov 27, 2024
    • RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.
      TypeScript
      Apache License 2.0
      0520Updated Nov 27, 2024Nov 27, 2024
    • Apify API client for JavaScript / Node.js.
      TypeScript
      Apache License 2.0
      2768165Updated Nov 27, 2024Nov 27, 2024
    • openapi

      Public
      An OpenAPI specification for the Apify API.
      JavaScript
      MIT License
      12173Updated Nov 27, 2024Nov 27, 2024
    • This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
      Apache License 2.0
      0274Updated Nov 27, 2024Nov 27, 2024
    • Apify API client for Python
      Python
      Apache License 2.0
      125089Updated Nov 27, 2024Nov 27, 2024
    • Apify SDK monorepo
      TypeScript
      Apache License 2.0
      35124117Updated Nov 27, 2024Nov 27, 2024
    • Utilities and constants shared across Apify projects.
      TypeScript
      Apache License 2.0
      111250Updated Nov 27, 2024Nov 27, 2024
    • apify-cli

      Public
      Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
      TypeScript
      19122362Updated Nov 27, 2024Nov 27, 2024
    • crawlee

      Public
      Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
      TypeScript
      Apache License 2.0
      68216k11814Updated Nov 27, 2024Nov 27, 2024
    • workflows

      Public
      Apify's reusable github workflows
      Python
      4744Updated Nov 26, 2024Nov 26, 2024
    • Apify integration for Zapier
      JavaScript
      Apache License 2.0
      1841Updated Nov 26, 2024Nov 26, 2024
    • Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
      TypeScript
      Apache License 2.0
      1041k2012Updated Nov 25, 2024Nov 25, 2024
    • The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
      Python
      Apache License 2.0
      10120130Updated Nov 25, 2024Nov 25, 2024
    • This is the experimental version of Web Automation Agent. The agent uses natural language instructions to browse the web and extract data.
      TypeScript
      Apache License 2.0
      202892Updated Nov 23, 2024Nov 23, 2024
    • HTTP client made for scraping based on got.
      TypeScript
      44558151Updated Nov 20, 2024Nov 20, 2024
    • Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
      TypeScript
      26000Updated Nov 20, 2024Nov 20, 2024
    • Apify ESLint preset to be shared between projects
      JavaScript
      Apache License 2.0
      0210Updated Nov 18, 2024Nov 18, 2024
    • Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
      JavaScript
      Apache License 2.0
      145851812Updated Nov 17, 2024Nov 17, 2024
    • Scrape list of available integrations from Make
      TypeScript
      0001Updated Nov 15, 2024Nov 15, 2024
    • Scrape list of Zapier integrations from Zapier website
      TypeScript
      0001Updated Nov 15, 2024Nov 15, 2024
    • Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
      Python
      Apache License 2.0
      4410Updated Nov 14, 2024Nov 14, 2024
    • Python
      Apache License 2.0
      1300Updated Nov 13, 2024Nov 13, 2024
    • Base Docker images for Apify actors.
      Dockerfile
      Apache License 2.0
      227093Updated Nov 8, 2024Nov 8, 2024
    • This tool integrates with AWS to monitor service usage costs and posts a summary of these costs to a Slack channel. The summary includes costs for various AWS services along with a chart that provides a visual breakdown of the costs over time.
      TypeScript
      MIT License
      0001Updated Nov 5, 2024Nov 5, 2024
    • This project is the 🏠 home of Apify actor template projects to help users quickly get started.
      Python
      192681Updated Oct 25, 2024Oct 25, 2024
    • A Homebrew tap for Apify tools
      Ruby
      1804Updated Oct 25, 2024Oct 25, 2024