Philadelphia Open Crime Data on Phoenix / HBase
This is an update to a previous article on accessing Philadelphia Open Crime Data and storing it in Apache Phoenix on HBase.
It seems an update to Spring Boot, Phoenix and Zeppelin make for a cleaner experience.
I also added a way to grab years of historical Policing data.
All NiFi, Zeppelin and Source is here: https://github.com/tspannhw/phillycrime-springboot-phoenix
It seems an update to Spring Boot, Phoenix and Zeppelin make for a cleaner experience.
I also added a way to grab years of historical Policing data.
All NiFi, Zeppelin and Source is here: https://github.com/tspannhw/phillycrime-springboot-phoenix
Part 1: https://community.hortonworks.com/articles/54947/reading-opendata-json-and-storing-into-phoenix-tab.html
Resources
We convert JSON to Phoenix Upserts.
We push JSON Records to HBase with PutHBaseReord.
Query Phoenix at the Command Line, Super Fast SQL
Resources
https://www.opendataphilly.org/dataset/crime-incidents/resource/f6e5998d-9d33-4a45-8397-3a6bb8607d10
Example Data
"dc_dist":"18",
"dc_key":"200918067518",
"dispatch_date":"2009-10-02",
"dispatch_date_time":"2009-10-02T14:24:00.000",
"dispatch_time":"14:24:00",
"hour":"14",
"location_block":"S 38TH ST / MARKETUT ST",
"psa":"3",
"text_general_code":"Other Assaults",
"ucr_general":"800"}
Create a Phoenix Table
Create a Phoenix Table
/usr/hdp/current/phoenix-client/bin/sqlline.py localhost:2181:/hbase-unsecure
CREATE TABLE phillycrime (dc_dist varchar,
dc_key varchar not null primary key,dispatch_date varchar,dispatch_date_time varchar,dispatch_time varchar,hour varchar,location_block varchar,psa varchar,
text_general_code varchar,ucr_general varchar);
Add NiFi / Spring Boot Connectivity to Phoenix
org.apache.phoenix.jdbc.PhoenixDriver
jdbc:phoenix:localhost:2181:/hbase-unsecure
/usr/hdp/3.1/phoenix/phoenix-client.jar
/usr/hdp/3.1/hbase/lib/hbase-client.jar
/etc/hbase/conf/hbase-site.xml
Run spring boot …