-
Notifications
You must be signed in to change notification settings - Fork 5
Home
Collection of services and tools for creating, processing and storing streams of MySQL data changes.
Testing, beta-level quality.
Flushes MySQL database tables to the binlog in order to have the initial snapshot of the database in the binlog.
Replicates data changes from MySQL binlog to HBase, preserving the previous data versions. Intended for auditing purposes of historical data. In addition can maintain special daily-changes tables which are convenient for fast and cheap imports from HBase to Hive.
HBaseSnapshotter is a Spark application that takes a snapshot of an HBase table at a given point in time and store it to a Hive table. Usually you can export from HBase to Hive but you can only get the latest version, as Hive doesn't have enough flexibility to access different versions of an HBase table. Spark framework allows this flexibility since it has the ability and the API to access and manipulate both HBase and Hive.