As I was falling asleep last night I thought up another (dinky) analysis I could run on the Craigslist data.
Question of interest: what state(s) had the highest proportion of Craigslist personal ads compared to their total population? Easy little analysis, I know, but it’s a fun thing to look at.
So here we go!
This map shows a gradient from white to deep purple. States that are whiter have a lower proportion of Craigslist ads with respect to the total state population. States that are a deeper purple, therefore, have a HIGHER proportion of Craigslist ads with respect to the total state population.
The three states with the lowest proportion of Craigslist ads:
West Virginia (0.021%)
The three states with the highest proportion of Craigslist ads:
Washington, D.C. (1.055%)
And yes, I realize Alaska and Hawaii aren’t on this map (and weren’t on the original US of Craigslist post either). Sorry about that. I’d remedy that, but it’s late and I want to go read some fanfic before I go to sleep. Hawaii’s the same color as Nevada; Alaska’s the same color as Illinois.
Sweet Jesus crackers, it’s data time!
Over the past month I’ve been collecting data off of Craigslist; specifically, data from the Craigslist personals. This is mainly because statistics is my crack I like to flex my analytical muscles as often as possible and I’ve been in a data drought for far too long.
Anyway, enough blabbering. ONWARDS!
Data: Craigslist Personals from all 415 individual Craigslist listings* within the United States, divided into the four main categories:
WFW: women seeking women
WFM: women seeking men
MFW: men seeking women
MFM: men seeking men
The data were collected over a 13-day span (March 1 through March 13). I recorded the number of personals under each listing that were posted within the last 15 days of whatever day in the 13-day span on which I was doing the collecting. So keep in mind there might have been some post-Valentine’s Day angst-driven posts for some of the listings.
Results of interest:
1. (via extrapolation) Approximately how many Craigslist personal ads (in the United States) are posted yearly?
2. (via correlation) How highly correlated are the number of ads per state and the population of the state? In other words, do more populous states have more personal ads?
3. (via graphs) Where in the United States do homosexual personal ads (WFW, MFM) outnumber heterosexual personal ads (WFM, MFW), and vice-versa?
4. (via way-too-meticulous-digging-through-individual-listings) Are there any individual listings that can be considered “unnecessary” due to lack of posts? Are there any individual listings that should be further divided due to way too many posts?
Let’s do this!
1. Approximately how many Craigslist personal ads in the United States are posted yearly?
Within 15 days, there were a total of 297,141 personals posted. That’s almost .01% of the US population.
Assuming there’s a fairly uniform number of personals being posted year-round, that would be a total of 7,230,431 ads per year (about 2.3% of the US population). That’s a lot of Criagslistin’.
2. Do more populous states tend to have more personal ads?
This’ll probably end up as a “duh,” but it’s worth checking out. Maybe EVERYONE in Wyoming is posting because they can’t find one another, while everyone in Cali is shying away from personals because they’re so sick of being around people.
Food for thought.
Here’s a quick little graph just to give you the idea of the range we’re talking about here.
State with the fewest number of ads: North Dakota (201 ads in 15 days)
State with the most number of ads: California (46,016 ads in 15 days)
To check if there’s a correlation between state population and number of ads, I ranked the states by the number of ads and also by the population, then ran a Spearman rank correlation on the two rankings (non-parametric statistics FTW).
rsp = .837
That’s a pretty high correlation, I don’t care who you are. So yes, the higher a state’s population, the more ads they are likely to have on Craigslist. Durh.
3. Where in the United States do homosexual personal ads (WFW, MFM) outnumber heterosexual personal ads (WFM, MFW), and vice-versa?
This was an interesting one that didn’t quite turn out as I expected. For one, there were more ads in the MFM section than any of the other three sections for pretty much every single listing. This brought the total of the homosexual ads well above the total of heterosexual ads for most listings. That alone was surprising to me.
What’s even more surprising, though, is the pattern of homosexual- and heterosexual-dominated ads by state. Here’s a map that breaks the states down by the ratio of homosexual ads to heterosexual ads.
In order to make keying this thing easier, I centered the ratios at zero, where zero indicates a ratio of 1:1, negative values indicate a ratio of more than one heterosexual posting for every homosexual posting, and positive values indicate a ratio of more than one homosexual posting for every heterosexual posting. I color-coded the map by creating six intervals on either side of zero, with each interval increasingly more imbalanced (fewer/more homosexual postings per heterosexual posting). Therefore, the more intense the colors get, the more imbalanced that state is in terms of the ratio of homosexual to heterosexual postings. I’m dumb and lost the original ratios, but they ranged from .292:1 (.292 homosexual posts for every heterosexual post; South Dakota) to 3:1 (3 homosexual posts for every heterosexual post; Washington, D.C.). States that have more homosexual ads are a deeper red; states that have more heterosexual ads are a deeper blue. States that have a near 1:1 ratio are white.
Can any of you dudes see any sort of demographic that this pattern follows? I was thinking that maybe the ratios followed the red/blue states, but that doesn’t appear to be the case. I also thought that maybe it would vary by general geographic region, but that doesn’t appear to be the case either (except for the Northwest, which is pretty “neutral” overall). Interesting stuff.
4. Are there any individual listings that can be considered “unnecessary” due to lack of posts? Are there any individual listings that should be further divided due to way too many posts?
This wasn’t as tedious as I thought it’d be…just basically involved going back through the data for the individual listings to see if there were any that had HUGE amounts of ads or any that had virtually none.
Some points of interest:
The average number of ads posted per listing was exactly 716.
Pierre, SD had only three ads posted.
New York City had 23,122.
WFW had the fewest ads overall (7,923 for the whole country), while MFM had the most (191,753).
Cool stuff, eh?
*when I say “listing” I mean things like “Pullman/Moscow” under Washington’s state or “Rockford” under Illinois…all the individual cities/towns/regions. When I say “personal” or “personal ad” I mean things like “Good Man Wanted” under Tippecanoe’s WFW section or “SEXCAPADES” under Boulder’s MFM.
So during a study break this afternoon I took a casual little jaunt over to Craigslist to see what those in Vancouver were most recently ranting about. For whatever reason I decided to scroll down the page rather than click the little “Canada” link at the top, and I noticed that a few of the US states had only one little sublisting beneath them (Wyoming, for example).
This got me to wondering: does the way Craigslist create its state listings reflect the uniformity of those states’ population distributions? In other words, for example, if a state only has cities listed as opposed to large areas like “western Wisconsin,” does that reflect the fact that the state has its population “clustered” into small areas and not uniformly distributed throughout the state?
Now of course you know me and you know how I do things, so this wasn’t going to be some simple analysis in which I would merely count up the listings or something and do a rank ordered map thing.
It has to be more complicated than that.
So without further ado, here’s what I did:
First I decided to take a look at the sublistings and rank them in order of size. It turns out that there are listings that range from as large as the entire state itself down to just regular cities. Here’s what I’ve got:
- Area (e.g., “northwest CT,” “heartland Florida”)
- Pair of cities (e.g., “Moscow/Pullman”)
Theory: the more uniformly a state’s population is “spread out” in the state, the more likely there will be larger area listings for that state (e.g., just the state listed, or just areas and counties). The less uniformly a state’s population, the more likely that there will be a lot of smaller area listings (like a lot of cities and pairs of cities) rather than large area listings.
Of course, there is the overall population to consider—for example, Wyoming just has “Wyoming” listed ‘cause nobody lives there. But there are slightly more populous states that also have just the state listed. Similarly, there are also slightly more populous states that just have a few cities listed, thus indicating that the small populations of these states are clustered into areas and not uniformly spread out.
So here’s how I quantified “uniformness”—I gave every city listed under every state a value of 1. I then gave ever pair of cities, county, area, and state listed values of .8, .6, .4, and .2, respectively. I then, for every state, summed these numbers and divided by the number of listings. This way, the more uniformly a state’s population is spread out, the closer this final number will be to zero (or .2, rather, because that’s the value I assigned to “state”), and the more clustered the population, the closer this number will be to 1.
If I could find some reference to compare this to I would, but I can’t find one, sorry.
“Seeking nerdy, quirky, curvy and dirty”
“SUGAR DADDY LOOKING For one Girl to SPOIL”
“looking for inteligence”
“E-Harmony reject looking for lotsa frisky fun”
“whats wrong with asslicking?”
I like to think this is the same guy as the E-Harmony guy.
“Scintillating discourse and spankings”
“city of dudes”
He refers to the city as “Mancouver”
“Who would win in a drunken fight Count Chocula or Tony the Tiger?”
Not gonna lie, had to resist the urge to email this guy.
“Seeking Totally Dysfunctional Co-Dependent Disaster of a Relationship”
“Origami Enthusiast wants to try Frisbee golf Seeks same”
I bet some lady somewhere’s going “alRIGHT!”
“Tarzan Seeks Jane for Jungle Fun”
“Must Love Chickens”
“I’m smarter than the average bear”
HEEEEEEEEEEEEEEEEEERE COOOOOOOOOOOOOMES YOOOOOOOGIIIIIIIIIIII!
And from someone’s message:
“…doesn’t think the key to finding someone lies in describing precisely how they have to look, like she’s ordering toppings from the “build-a-man” counter at Subway…”
…does the “build-a-man” counter at Subway have Five Dollar Footlongs?
Fun times on Craig’s List. Just a few personals around Moscow.
If you’re a musician, odds are you can probably help me! :-) (Moscow)
Most musicians I know – including myself – partake of a particular green leafy herb that I am in need of, as I just moved to Moscow from Boise. If you can help hook me up, I will share!
Want woman with lots of money – 21 (Moscow)
Are you lonely?
Can you pay my rent for me?
Penis – 23 (Earth)
Sarcastic pessimist seeks same – 22 (Pullman)
Quick and dirty, here we go!
1. Do you like to cook?
2. What do you think of recycling?
3. Are you against all drug use?
4. Where do you see yourself in 10 years?
5. Top five bands/genres?
Oh yeah, tell me why. That’s right, it’s a fucking short answer quiz. Send me your questions.
Let’s find our rainbow! I’m so excited I could vomit.
Girl humping pole on sidewalk – m4w – 21 (AA)
We have compatible qi.
I dont like bars.
You have such a hot ass.
Want to drink at my house?