Northeastern University, Spring 2018
This week we’re going to be using Carto like we did in Week 2. We’re going to learn some standard types of geospatial analysis. Although the details of the techniques are specific to the Carto software, they are also standard types of analysis that you can do in QGIS or ArcGIS. Once again, the idea is not to give you an expertise over every single kind of geospatial analysis, but to give you hands-on experience with using specific ones. The format for today is going to involve following a few tutorials that Carto provides to its users, but following those same steps using our own data.
For today’s class we’re going to use two Boston municipal datasets, one of which you used in Week 2. Connect to each of these datasets in your Carto account.
The first tutorial is the Intersect Second Layer tutorial on Carto.
We’re going to try and go through it using our own data. The idea is that we want to run a spatial analysis that will count the number of Code Enforcement Violations from 2017 (represented by the ~52,000 points in our cepviolations_2017.csv
file) in different Boston neighborhoods (the Boston neighborhoods polygon layer). This is going to add a column to our Boston neighborhoods layer that counts up all of the points that fall within its boundaries. Try to come up with the following:
For this exercise, we’re going to learn how to group points together using k-means clustering. Using the Code Enforcement violations dataset, we’ll try to group the ~52,000 code violations into six geographical zones. Follow the Clusters of Points Carto tutorial, but substitute your CEP violations dataset instead of store locations: https://carto.com/learn/guides/analysis/calculate-clusters-of-points/.
Finally, we’re going to combine clustering (Exercise II) with a second step of analysis: finding the centroids of our six new “zones” (clusters) of CEP violations. If we were going to assign six city employees to go around issuing violation tickets in these six areas, where should they be stationed? Basically, what is the most central location in each of these areas (“as the crow flies”)? Follow this Carto tutorial, but once again substituting your Boston CEP violations from 2017 rather than New York Citi Bike data: https://carto.com/learn/guides/analysis/find-centroid-of-geometries/
Note: You won’t be using the “weighting” feature of this analysis, since the violations don’t have this kind of quantitative data attached to each individual point. Ignore the steps of WEIGHTED BY
and AGGREGATE
.
Create a final map that includes the following elements: