Ingestion des données csv provenant du repsitory Github https://github.com/decoderleco/deces_europe

  • Download the CSV dataset file,
  • Sends the CSV dataset file to kafka (publish to kafka topic)
  • Persists the CSV dataset file to an S3 bucket (using the local filesystem in much too uncomfortable, using an S3 bucket so much better)
  • the S3 bucket (containing the CSV dataset file) is added on a git branch in LakeFS

Constructors

Properties

dataWorkDir?: string
filePathInRepo: string
gitVersion: string
baseUrl: string = "https://raw.githubusercontent.com/decoderleco/deces_europe"

Methods

Generated using TypeDoc