d6tjoin - Identify and analyze join problems
Joining datasets is a common data engineering operation. However, often there are problems merging datasets from different sources because of mismatched identifiers, date conventions etc.
d6tjoin.utils module allows you to test for join accuracy and quickly identify and analyze join problems.
Here are some examples which show you how to:
- do join quality analysis prior to attempting a join
- detect and analyze a string-based identifiers mismatch
- detect and analyze a date mismatch
|