Total number of lines in KEN_ALL.CSV: 123213

Number of distinct postcodes:

sqlite> select count(distinct(postcode)) from postcodes;
119071

Number of duplicate lines = 4142

Number of lines in jigyosyo.csv: 21704

Number of addresses which correspond exactly to addresses in KEN_ALL.CSV: 

Found 19542; not found 2162.

