
Backend Engineer
Full Job Description
Finrep is seeking a talented Data Engineer to take ownership of the data infrastructure that drives our core AI-powered SEC financial reporting product. This role is crucial for managing the ingestion and parsing of SEC filings, XBRL taxonomies, and financial disclosure documents, as well as overseeing ETL orchestration, search indexing, and cross-system data synchronization.
You will be working with complex, semi-structured financial data, moving beyond traditional relational databases to build robust, reliable, and idempotent data pipelines. As a founding-level engineer, you will have a significant impact on shaping the future scalability and architecture of Finrep's data layer.
Key Responsibilities:
- Develop and maintain backend services utilizing Python, Django, and Django REST Framework.
- Design and manage ETL pipelines using Prefect, deployed on GCP Cloud Run.
- Build robust ingestion and parsing pipelines for SEC 10-Q/10-K filings, XBRL taxonomies, and various disclosure documents.
- Construct and optimize search infrastructure using OpenSearch, focusing on indexing strategies, relevance tuning, and query performance.
- Design and implement Change Data Capture (CDC) pipelines for seamless data synchronization between PostgreSQL and OpenSearch.
- Leverage Pub/Sub and Celery for asynchronous processing, background job management, and task orchestration.
- Enhance observability, improve system reliability, and optimize costs across the entire data infrastructure.
Qualifications:
- 2 to 4 years of professional backend engineering experience, with a strong focus on Python.
- Proven experience with Django and Django REST Framework.
- Solid understanding of PostgreSQL, including schema design, query optimization, and indexing techniques.
- Experience in building ETL pipelines, workflow orchestration, or background job processing systems.
- Familiarity with concepts like retries, dead-letter queues, idempotency, and failure handling in asynchronous systems.
- Experience with cloud platforms like GCP or AWS, specifically services such as Cloud Run or Cloud Functions.
- Comfort working with semi-structured or document-oriented data formats (e.g., XML, HTML, nested structures) is essential.
- Strong debugging skills and a proactive, ownership-driven approach to problem-solving.
Ideal Candidate Profile (Bonus Points):
- Prior experience with OpenSearch or Elasticsearch.
- Hands-on experience with Prefect for workflow orchestration.
- Experience with CDC or event-driven data synchronization pipelines.
- Familiarity with SEC filings, XBRL, or financial reporting data domains.
- Exposure to working with agentic systems.
Technology Stack: Python, Django, DRF, PostgreSQL, OpenSearch, Prefect, GCP Cloud Run, Pub/Sub, Celery, Docker.
This is a unique opportunity for a founding-level engineer to gain significant ownership of data infrastructure at an AI company tackling a complex and underserved problem in public company financial reporting. The challenging domain, non-trivial data, and broad scope ensure a continuous stream of interesting and impactful problems to solve.
Company
Finrep AI
Finrep is an innovative AI-powered platform revolutionizing SEC financial reporting for public companies. We empower businesses to streamline the preparation, review, and filing of critical disclosure...