StabRise
ProjectsBlogAbout
All Posts
  • spark (5)
  • spark-pdf (5)
  • databricks (2)
  • data-extraction (1)
  • ai (1)
  • scaledp (1)
  • llm (1)
  • spark-connect (1)
  • Spark PDF with Spark Connect

    sparkspark-pdfspark-connect

    This blog post introduces Spark PDF, a custom data source for Apache Spark that empowers users to seamlessly integrate PDF data into their Spark workflows.

    Mykola Melnyk
    Mykola Melnyk

    Machine Learning & Data Processing Expert

    March 6, 2025

Document Processing Solutions

Scalable by the Spark. Process structured and unstructured data with ease.

Project
Spark PDFScaleDPPDF RedactionDe-identify
Legal
Privacy PolicyTerms of ServiceBlog
Contact Us

Wilcza 19i, lok. 2, Marki, Mazovezkie, Poland, 05-270

info@stabrise.com
+48-790-844-156
mailMailgithubGitHublinkedinLinkedin
© 2025
•
StabRise: Scalable AI-Powered Document Processing Solutions.