class: center, middle, inverse background-image: url(https://www.unomaha.edu/university-communications/downloadables/campus-icon-the-o/uno-icon-color.png) background-position: 95% 90% background-size: 10% # Analyzing Content <br> <br> <br> [Justin Nix](https://jnix.netlify.app) *School of Criminology and Criminal Justice* *University of Nebraska Omaha* <br> <br> <br> <br> <br> .white[November 7, 2024] ??? --- class: top # What are Secondary Data? <img src="secondary_data.png" width="60%" style="display: block; margin: auto;" /> ??? Secondary data analysis is simply the act of collecting or analyzing data that were originally collected by someone else at another time. --- class: top # What are Secondary Data? -- Generally, four types: -- <img src="secondary_data_types.png" width="50%" style="display: block; margin: auto;" /> ??? *Official statistics* refers to aggregated data released for public use by, e.g., police, courts, and correctional agencies, or other federal, state, and/or local agencies. - Example: the incident-level reported crime data posted by OPD that you used for your sampling assignment earlier this semester. - Or: [crime maps](http://opdcrimemapping.com/) produced by OPD. *Official records*, meanwhile, refers to the more detailed, raw data that these agencies keep primarily for administrative purposes. These typically contain more granular information but access may be restricted due to privacy concerns, ongoing investigations, etc. - Example: homicide case files, police-involved shooting investigations. --- class: top # What are Secondary Data? -- Check out the University of Michigan's [ICPSR](https://www.icpsr.umich.edu/web/pages/) <img src="icpsr.png" width="50%" style="display: block; margin: auto;" /> ??? ICPSR contains data and information from thousands of studies, including some conducted independently and some by the US government (e.g., BJS). --- class: top # What are Secondary Data? -- Or browse [GitHub](https://github.com/jnixy) <img src="github_jnixy.png" width="50%" style="display: block; margin: auto;" /> --- class: middle, center, inverse # Discussion Question ## Do you think POLICE RESOURCES are allocated equitably by RACIAL/ETHNIC DISTRIBUTION and SOCIOECONOMIC STATUS? --- class: top # What are Secondary Data? ## Case Study [David Thacher (2011): "The Distribution of Police Protection"]((https://link.springer.com/article/10.1007/s10940-010-9125-3) <img src="thacher.png" width="45%" style="display: block; margin: auto;" /> ??? **Method**: matched UCR and Census data from 1970 to 2000. Included a measure of income inequality as a proxy for poverty. So this is a good example of using official statistics as a secondary data source. It's also a sort of historical study of sorts. --- class: top # What are Secondary Data? ## Case Study [David Thacher (2011): "The Distribution of Police Protection"]((https://link.springer.com/article/10.1007/s10940-010-9125-3) <img src="thacher_t1.png" width="80%" style="display: block; margin: auto;" /> ??? **Key finding**: wealthier and lower-crime areas receive more police per crime. Crime became more concentrated in the poorest communities over these 4 decades, and as a result, police resources per crime have become less egalitarian. - "The result is a growing workload disparity between rich and poor jurisdictions. In rich jurisdictions, each police officer has responsibility for fewer and fewer crimes over time, while in poor jurisdictions this part of the police workload has either remained constant or grown." (p. 289) - Note that crime is much rarer in wealthier areas: should police spend less time in those places? - How might this go over with the residents of those areas? - Should that matter? Considerations... - Can lead to **legal cynicism** in impoverished communities - What is the **ETHICAL** responsibility of the police? Should resources be allocated equitably? What does that mean in this context? --- class: top # Historical Events Research <img src="historical_events.png" width="50%" style="display: block; margin: auto;" /> ??? We can improve our understanding of social processes when we are able to make comparisons to other times and places. **Historical Events research** seeks to develop theoretical explanations of historical events and processes rather than offer "fact-centered" descriptions of them. There are no hard rules for determining how far in the past the focus of research must be for it to be considered "historical"; in practice, we generally consider something historical research if it focuses on a period prior to the experience of most of the people conducting the research. -- ***What do you think are some methodological challenges here?*** ??? - Documents and other evidence may have been lost or damaged - What evidence there is may be a biased sample - E.G., toward more newsworthy figures, or people who were more prone to writing - The **feelings** of people involved in past events may be hard/impossible to reconstruct --- class: top # Historical Events Research ## Case Study Ohio State University's [Historical Violence Database](https://cjrc.osu.edu/research/interdisciplinary/hvd) <img src="nyc_homicides_historical.jfif" width="75%" style="display: block; margin: auto;" /> ??? Huge interdisciplinary collaboration that has produced historical violence data for: - NYC - Chicago (1870 - 1930) - LA County (1830 - 1874) - Philadelphia (1839 - 1932) - San Francisco (1849 - 2003) - "the American West" - Asia - Australia and New Zealand - Several countries in Europe As well as information about riots in the US --- class: top # Historical Events Research ## Case Study Eric Rise (1995): [*The Martinsville Seven*](https://www.upress.virginia.edu/title/2797/) <img src="martinsville_seven.png" width="45%" style="display: block; margin: auto;" /> ??? In January 1949, a 32y.o. white woman in Martinsville VA accused seven young black men of violently raping her. - Within 2 days, state and local police had arrested and obtained confessions from each of the seven suspects. - In brief trials held over the course of 11 days, six separate juries convicted the young men and sentenced them to death. - In February 1951, each died by electric chair. Eric Rise examined a ton of historical documents: legal papers that lawyers made available, official transcripts, briefs and petitions, related case law from SCOTUS, annual reports from the VA DoC, and newspaper accounts Rise highlights the efforts of the attorneys who, rather than focusing on procedural errors, directly attacked the discriminatory application of the death penalty. The Martinsville Seven case was the first instance in which statistical evidence was used to prove systematic discrimination against blacks in capital cases, and it was the first time that equal protection arguments were used to challenge the constitutionality of capital punishment. --- class: top # Comparative Methods <img src="world.png" width="50%" style="display: block; margin: auto;" /> ??? Comparative Methods are used by researchers interested in making often transnational connections and comparisons between entities across time and space (historical periods and geographical contexts, respectively) -- <p style="text-align: center;">Can be <b>descriptive</b> or <b>analytic</b></p> ??? Research that seeks to understand the structure, nature, or scope of nations' criminal justice systems or rates of crime is **descriptive.** Research that seeks to understand how national systems work and the factors related to their operations is **analytic.** Doesn't have to be cross-national...this type of research can be used at smaller units of analysis like states and cities. - E.G., Dr. Richards and I published a paper looking at the effects of COVID-19 on victim help-seeking behaviors in 7 jurisdictions. We felt this was more useful than the predominant approach at the time of trying to generate nationally generalizable estimates. --- class: middle, center, inverse # Discussion Question ## Do you think WAR has an effect on a nation's HOMICIDE RATE? --- class: top # Comparative Methods ## Case Study Archer & Gartner (1984): [*Violence and Crime in Cross-National Perspective*](https://www.ojp.gov/pdffiles1/Digitization/105516NCJRS.pdf) <img src="ww2.png" width="43%" style="display: block; margin: auto;" /> ??? They launched the *Comparative Crime Data File*, with extensive crime data from 110 nations and 44 major international cities, spanning 1900 to 1970 War might affect homicide rates through a number of theoretical mechanisms: 1. **Social solidarity model**: wartime will decrease violence because it increases solidarity among citizens. 2. **Violent veteran model**: Postwar levels of violence will increase as a result of the violent acts of returning war veterans. 3. **Legitimization of violence model:** during war, society reverses its prohibitions against killing, instead honoring acts of violence that during any other time would be considered murderous. By temporarily granting social approval of violence, there may be a lasting reduction of people's inhibitions, thereby increasing levels of violence within nations. The authors found that most combatant nations experienced substantial increases in their rates of homicide following both "small" and "large" wars. Key findings from their research include: 1. There is significant variation in homicide rates across countries, which they argue **cannot be explained solely by differences in reporting practices**. 2. Economic factors, such as **income inequality**, have a strong correlation with the rate of violent crimes, particularly homicide. 3. The authors also explore the impact of political systems and the **level of democracy**, finding complex relationships that suggest more nuanced social dynamics influence crime rates **beyond simple economic or political explanations**. --- class: top # Comparative Methods ## Case Study Savage et al. (2008): ["Economic assistance and crime"](https://doi.org/10.1177/1477370807087645) <img src="economic_assistance_crime.png" width="43%" style="display: block; margin: auto;" /> ??? Authors examined the relationship between social welfare spending and homicide in 52 countries. - Does welfare spending encourage dependency and weaken personal initiative? If so, it might encourage crime. - Or does it equalize inequities, which might be expected to reduce crime? when analyzing the **Correlates of Crime** dataset (1960-1984), they found that **social welfare spending was negatively related to theft and homicide**. - In other words, as welfare spending went up, rates of those crimes generally went down. Even after controlling for other important factors like GDP and unemployment. From the abstract: > Our findings suggest that per capita social welfare spending is associated with lower rates of both theft and homicide. Time lagged analysis suggests that the current level of social welfare spending, not that of recent years, accounts for any possible suppression of crime. The data also suggest that, whereas high homicide rates do not appear to inspire increased social welfare generosity, lagged measures of theft rates are associated with subsequent increases in social welfare spending among high theft countries. --- class: top # Content Analysis ??? • Content Analysis involves the systematic, quantitative analysis of message characteristics, with the goal being to develop inferences from text -- ## Identifying a population of documents or other textual sources <img src="file_cabinets.png" width="43%" style="display: block; margin: auto;" /> ??? 1. Identify a population of documents or textual sources for study. 2. Determine the unit of analysis. 3. Select a sample of units from the population. 4. Design coding procedures for the variables to be measured. 5. Test and refine the coding procedures. 6. Base statistical analyses on counting occurrences of particular items. Keep in mind that different coders might perceive different meanings in the same text segments. So explicit coding rules are necessary to maximize consistency. --- class: top # Content Analysis ## Case study Goff et al. (2008): ["Not yet human"](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=c6027ac81ceee887cb3b6bf0bc09f66337cc46ec) <img src="ape_metaphors.png" width="50%" style="display: block; margin: auto;" /> ??? Psychologists at Penn State and Stanford do a series of experiments showing that Americans implicitly associate Black men and apes. - The studies also demonstrate how this cognitive process significantly alter their judgments in criminal justice context, increasing endorsement of violence against Black suspects - Then, in an archival study of actual capital criminal cases, they show that news articles written about Black people convicted of capital crimes are more likely to contain ape-relevant language than news articles written about White convicts - Further, those who are implicitly portrayed as more apelike in those articles are more likely to be executed. **Data**: death-eligible cases in Philadelphia between 1979 and 1999 - 153 cases for which the authors had mugshots and press coverage in *The Philadelphia Inquirer* - They found 788 articles covering these cases. - Each article was "coded for the presence of 54 words that connoted bestial or subhuman qualities" - Words were chosen from a random sampling of 5% of the articles - These words were then shown to 24 naive raters who read each word in its context and were asked to think of an animal that was associated with the target word in each sentence. - 35 words elicited ape, monkey, or gorilla from more than 12 of the raters. - Authors showed these 35 words to a new set of raters, and this time 17/24 answered ape, monkey, or gorilla. - Ultimately, each article was scored in terms of the total number of ape words it used to describe the defendant. --- class: top # Crime Mapping <img src="crime_mapping.png" width="60%" style="display: block; margin: auto;" /> ??? Police departments have been doing "crime mapping" for 100+ years, but that's not what we're talking about today. We're talking about mapping for traditional research purposes (e.g., testing theories of criminal behavior), not mapping for investigative purposes. Crime mapping allows for an advanced form of statistical analysis. --- class: top # Crime Mapping ## Case study Shaw & McKay (1942) [*Juvenile delinquency and urban areas*](https://psycnet.apa.org/record/1943-00271-000) <img src="shaw_mckay.png" width="45%" style="display: block; margin: auto;" /> ??? Shaw & McKay mapped thousands of arrests for juvenile delinquency in Chicago, using police records to determine the names and addresses of those arrested between 1927 and 1935. They found a decrease in rates of delinquency as distance from the city center (Zone 1) increased. - Zone 2 had the highest rates of delinquency "The communities with the highest rates of delinquents are occupied by these segments of the population whose position is most disadvantageous in relation to the distribution of economic, social, and cultural values...these have the fewest facilities for acquiring the economic goods indicative of status and success in our conventional culture" (pp. 318-19). **But keep in mind**: what do arrests as a measure of delinquency tell us? What do they not tell us? --- class: top # Methodological Issues when Using Secondary Data <img src="methodological_issues.png" width="50%" style="display: block; margin: auto;" /> ??? Small number of cases, spotty historical records, variable cross-national (or state) record keeping practices, different cultural and linguistic contexts - All limit the confidence we can havea in measures, samples, and causal conclusions It's on the researcher to use secondary data **responsibly**. You should be able to answer the following questions: 1. What were the goals/purposes of the researcher or agency who initially collected the data? 2. Who collected the data and what were their qualifications? Is their methodology transparent? 3. What data were collected & what were they meant to measure? 4. When were the data collected? 5. How consistent are the data with data made available from other sources? 6. Is their missing data, and if so, how significant of a concern does it appear to be (e.g., is their a meaningful pattern of missingness or is it MCAR)? --- class: top # Methodological Issues when Using Secondary Data ## Measuring across contexts -- <img src="measuring_across_contexts.png" width="77%" style="display: block; margin: auto;" /> ??? The meaning of concepts and the operational definition of variables may change **over time** and **across places**. The values of your measures for particular geographic units like counties, or police jurisdictions, may vary over time simply due to changes in the physical boundaries of these units. This is a map of all known incidents where police officers discharged their firearms with the intent to strike a person from 2014 to 2021, as per the SPOTLITE data compiled by researchers at the University of Illinois. I put it here to (a) show the variation but also (b) point out that it's likely easier to track down shootings in places like NYC or LA, with huge local media markets that often make national headlines, than in rural and remote areas. Also, if you wanted to collect national data on lower levels of police force, you'd run into problems because there is no concensus definition of what "use of force" is, let alone what officers are required to document. Here's another example: the definition of "police violence," as tracked by Mapping Police Violence, has changed over time: [link](https://jnix.netlify.app/post/post25-mpv-updates/) --- class: middle, center, inverse # Have a great day! 😄 ## *Not everything that can be counted counts,* <img src="einstein.png" width="40%" style="display: block; margin: auto;" /> ## *and not everything that counts can be counted.* <!-- ```{css, echo=FALSE} --> <!-- @media print { --> <!-- .has-continuation { --> <!-- display: block; --> <!-- } --> <!-- } --> <!-- ``` --> <style> p.caption { font-size: 0.5em; color: lightgray; } </style>