Oesophagus Ecosystem

Share your use-case: Google Form

Will help in shaping initial features.

Read more about it on my Blog at Towards Data Science: https://tinyurl.com/yyqr79dh

Oesophagus Ecosystem

Oesophagus enables you to deploy an entirely plug-n-play Data Infrastructure to advance your organisation's data capability.

The architecture consists of:

Data Producers, services to fetch data from Relational Databases, 3rd Party APIs, etc.
Stream Processors, like Kafka Streams and KSQL.
Data Consumers, services to load data into Columnar or Document-Oriented Databases, Search Indices, or other downstream databases and services.

Example: Postgres to Elasticsearch Real-Time ETL Setup:

Requirements

Install docker
Install docker-compose

Deployment

# Start kafka, connect, schema-registry, ksqldb, ksqlcli, postgres, elasticsearch and automation-scripts
$ docker-compose up -d

Testing Services

# GET Request on Elasticsearch server to test availability
$ curl -f 'localhost:9200'

# Search all indices in Elasticsearch
$ curl -f 'localhost:9200/_search'

Why use Oesophagus's Postgres CDC?

Oesophagus Postges CDC Producer is built to Extract, Transform and Load Relation Databases' data to Downstream databases/services.

It uses Change-Data-Capture Pattern to read changes from the WAL (Write-Ahead-Logs) of the source database.

Change Data Capture (CDC), as its name suggests, is a Database Design Pattern that captures individual data changes instead of dealing with the entire data. Instead of dumping your entire database, using CDC, you would capture just the data changes made to the master database and apply them to the BI databases to keep both of your databases in sync. This is much more scalable because it only deals with data changes. Also, the replication can be done much faster, often in near real-time.

Information Source: FlyData

Functionality

Note: Before starting the service, wal2json plugin should be installed on your postgres container to fetch database logs.

As the service starts, it will first make a Full Table Replication for all the listed table name keys in producer.json.
After the full table migration, the service starts listening to database logs using the replication slot that is created automatically before Full Table Migration starts.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
consumers		consumers
images		images
jobs/streams-init		jobs/streams-init
producers		producers
services		services
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
default.env		default.env
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consumers

consumers

images

images

jobs/streams-init

jobs/streams-init

producers

producers

services

services

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

default.env

default.env

docker-compose.yml

docker-compose.yml

Repository files navigation

Share your use-case: Google Form

Oesophagus Ecosystem

Example: Postgres to Elasticsearch Real-Time ETL Setup:

Requirements

Deployment

Testing Services

Why use Oesophagus's Postgres CDC?

Functionality

About

Releases

Packages

Contributors 3

Languages

License

behindthescenes-group/oesophagus

Folders and files

Latest commit

History

Repository files navigation

Share your use-case: Google Form

Oesophagus Ecosystem

Example: Postgres to Elasticsearch Real-Time ETL Setup:

Requirements

Deployment

Testing Services

Why use Oesophagus's Postgres CDC?

Functionality

About

Topics

Resources

License

Stars

Watchers

Forks

Languages