Category Archives: Blogging

Eh, good enough

I feel bad for not doing anything for my 8 year anniversary. So here’s some crap:

  • Total number of words over the past year (May to May): 99,463
  • Average number of words per sentence: 12.09 (in general, I write like 27-word sentences, but I think they average out with all my one-word survey responses and my filler text such as “YO!” and “Anyway.” and “Weird.”
  • Gunning Fog Index: 8.25 (so you need about 8.25 years of formal education to understand my blathering, apparently)
  • Wordle!

Untitledlll

I don’t know why in the hell I started this blog in May. I guess I wasn’t thinking that this would always be the busiest month of the year.

Anyway. Thanks again for reading, all you loyal followers! And thanks for all your comments, Matt!

Eigenblogger Turns Eight!

That’s right, I’ve been blogging for eight years now.

That’s 2,923 days (counting today).

I was hoping to do something cool for today, but I’m obscenely busy and didn’t manage to plan anything.

But there will be later anniversaries, I’m sure. Number 10 will be big, I promise!

Thanks for reading, y’all, I appreciate it. :D

I don’t have a title!

CRAP I just realized that my 8-year blogging anniversary is happening in like 45 days. What should I do, anything special?

I was actually thinking of doing a week of vlogs, but I’m an ugly buttface and no one wants to see that AND the basement’s always filthy and no one wants to see that AND that’s like the week before finals or something and I’ll be in too much of a panic to be entertaining.

Heck, I’m not even entertaining now and it’s spring break!

BAH.

GIF - Excited

A Rant…But Not Really

So today’s topic has been on my mind for awhile, and since I have nothing else that’s dying to be said today, I felt like I could just talk about it now.

At the beginning of last semester I gave the link to my blog to a newly acquired friend (said friend: I don’t know if you still read this, but if you are, you probably know what I’m going to talk about, haha). They read a few pages and came back to me the next day saying that the whole blog felt rather pointless.

Understandably, this rather quick judgment had me kind of miffed. But then I realized that they probably saw my blog as pointless because they’re just looking at the surface of it.

And at the surface, it is pretty pointless. I mean really, there’s not much else here apart from surveys, crappy art, crappy rants, internet stuff, and me obsessing about school/stats/Leibniz/other such things.

But I’ve kept this blog since 2006. That means that I’ve blogged from my last month in high school up until now. That’s about 7.5 years now. That’s a long freaking time, especially when it comes to keeping up a daily routine. And there are a lot of reasons why I’ve decided to keep the routine of blogging and why this blog may seem, at the surface, “pointless.”

  • I use this blog to get out some of the extra noise in my head. Doing so may take the form of a rant, a reflection, a discussion about school, whatever. Writing it out just helps to clear some of the clutter.
  • I use this blog to post some of my creative things: my writing, my drawings, my “songs.” However crappy they are, I do like to get them out there somewhere.
  • I use this blog to talk about my passions. Not a lot of my friends like the things I like. It’s hard to find someone to sit down with and rave about statistics. It’s hard to find someone to quote Achievement Hunter with (actually, since I don’t know anyone in person who’s a fan of them, I can’t really find anyone to do this with) or who will talk about how wonderful Leibniz is. I talk about my passions here because if I didn’t, I’d never get to talk about them at all.
  • I use this blog to retain my sanity. I had practically no friends in Vancouver, no friends in London, no friends in Tucson, and very few friends now that I’m back in Moscow. I know it sounds cheesy, but just having somewhere where I could get words out into the public (even if that public was the vast, disinterested internet) made those times manageable.
  • I use this blog to record my life. It’s not all of my life by any means, but it’s a decent chunk of it. I can’t count the number of times I’ve been able to use my blog to go back and find the specific date or dates of something important or something school-related. I love having a record of things, and that’s what this blog is.

So yeah. On the surface, all my posts (or at least the vast majority of them) are probably pretty meaningless. But they mean something to me. They’re part of me. And I’m okay with it if you don’t want to read it.

Just please don’t think of it as pointless, because it most certainly is not.

wtf

I don’t know what to say today,
Ha, ha-ha, ha-ha
I didn’t plan a blog today,
Ha, ha-ha, ha-ha.

This morning sucked and blogging was the last thing on my mind
And now it’s nearing dawn and I don’t want to get behind.

So here’s a poem thing for you,
La, la-la, la-la
Typed directly; impromptu,
La, la-la, la-la.

I’m sorry for the drivel, I’ve a bad case of the the blahs
But this is all you’re getting on this Friday night because

I didn’t plan a blog today,
Fa, la-la, la-la
(But that’s something I should always say,
Fa, la-la, la-la.)

ALRIGHTY

Let’s put my 10,000 Days goal into some perspective: I am approximately 25.5 years old. Do you know how many days I’ve been alive?

9,340, counting today.
Nine thousand, three hundred and forty days.

So if I had been blogging from the day I was born, I still wouldn’t have reached my goal yet.
That is like eight levels of psycho.

As it stands, I’ve got about 20 years of blogging to go before I hit 10,000 posts. That is a long, long time, my friends. Think it’ll happen? I hope so!

(Haha, sorry, I don’t have anything exciting to talk about today.)

Fifty Shades of Eigengrau

Why does my butt hurt? I walked the 20-mile round trip to Pullman, I didn’t butt-glide it.

Anyway.

So out of curiosity (and because I’ve got an INFLUX OF FREE TIME that I’m failing to use constructively), I went back and made a list of every Leibniz reference I’ve made on here since February 19, 2008 (the day we covered him in Modern Philosophy).

Because that’s the type of thing I do.

Results:

  • 26 blogs dedicated solely to him.
  • 20 pages’ worth of Leibniz references, not counting the material from the 26 Leibniz-only blogs.

Holy crap, I am obsessed.

Am I sorry?

NOPE.

I also REALLY miss calculus class. Time to MIT it up!

Psst. Hey.

Seven years.

When I started blogging seven years ago, I would never have guessed I would still be doing it in 2013. Never, ever, EVER would have guessed.

Hell, I started it on a whim.

And yet here we are.

Anyway.

I haven’t planned anything ‘cause I’ve been crazy busy and I only remembered that this was my anniversary about an hour ago.

So instead I’d like to thank all my loyal followers, both long-time (MATT!) and newer. I don’t understand why you follow, but I certainly appreciate that you do. Feel free to leave comments. I love comments. I promise I’ll be better about replying to them once this psycho semester is over.

I’d also like to thank Aneel and E’raina who were the two who peer-pressured me into blogging in the first place. Doing so has been very cathartic at times and at other times has just let me get out all the sludge that builds up in my brain to make room for semi-normal thought.

20 more years to go!

10,000 Views!

I hit 10,000 views on Eigenblogger this afternoon.

Shoutout to you readers! Thanks, you guys, for sticking with this blog through all its crapspazzle.

Have some pics/gifs.

Milky Way Rd

Suggestion Bach

GIF - Animals

GIF - Uncle Sam

Converge

Plane

Rain Dance

Winnie the Pooh

2,500

2,500 days ago, I published my very first blog post.

That’s a little less than 7 years ago (6 years, 10 months, 4 days), for those of you needing a different metric.

A lot can happen in 2,500 days.

I, for instance (in roughly chronological order):

  • Graduated high school
  • Got my wisdom teeth pulled
  • Went on 2 cruises to Alaska
  • Marched a Seattle Seahawks halftime show
  • Witnessed the rise of YouTube
  • Witnessed the fall of MySpace
  • Discovered my passion: statistics
  • Got dumped by someone
  • Discovered Leibniz
  • Roomed with one guy I knew fairly well plus three guys I didn’t know at all
  • Dumped someone
  • Had the most genuine relationship I’ve ever had
  • Earned 2 bachelor’s degrees
  • Turned 21
  • Got accepted into grad school twice
  • Survived swine flu
  • Lived in Vancouver, BC
  • Witnessed the rise of Twitter
  • Ran a 10k
  • Went skydiving
  • Earned a master’s degree
  • Quit a PhD program
  • Walked over 1,000 miles in about 11 months
  • Saw Mount Rushmore
  • Lived in London, ON
  • Been (mentally) very, very sick
  • Took a year off from school
  • Medically withdrew from a PhD program
  • Lived in Tucson, AZ
  • Got a job teaching statistics
  • Went back to undergrad
  • Won NaNoWriMo four times
  • Turned 25
  • Blogged every day

And probably a lot of other stuff that I’m not remembering off the top of my head.

But you know what’s really cool? I’m currently only ONE QUARTER of the way to my goal of 10,000 posts.

Think of all the other things that will happen by post 10,000! That’s not going to be until 2033. What will the world be like in 2033?

I’m excited. That seems so far away.

1 Year as eigenblogger.com

So I’ve had my domain name (eigenblogger.com) for a year now. The WordPress map was implemented at about the same time:

sss

Every continent except for Antarctica, bitches!

 

More specific viewing locations from ClustrMap:

sdfsdf

Cool, huh?

I’ve actually got a couple milestones coming up:

10,000 views (since I’ve been on WordPress)

My 2,500th post (a fourth of the way to my goal!): that’s coming up in like 15 days

7-year anniversary: May 1st…it’s awhile until then, but still.

VROOM!

BEWARE THE IDES OF FEBRUARY

Once I stop being so damn busy these will get better, I promise.

But right now I’m spending about 14 hours on campus every day and I’m STILL not getting everything done. So as unfortunate as it is, blogging has fallen to “low priority” status, at least for now.

I’m thinking that sometime in the near future I’m going to a week of video blogs, ‘cause I’ve never done such a thing and I’ve got a fancy-shmancy iPad (birthday present) for recording my inane drabble in video format.

But as for now, I’ve achieved crippling levels of suck in pretty much every aspect of my life, so I’m going to go curl up and try not to do anything too self-destructive.

Update nonsense

So if you’re reading this you’ve probably seen the changes I made with respect to the title and color of my blog. Purple and blue are nice, but rainbows are “me.”

I had some other colors for the lettering of “Eigenblogger” other than the yellow—purple, white, black, blue—but I think yellow works the best.

There might be a few other tweaks in the next few days, so if things are tripping around, don’t freak out.

 

As if anyone visits this thing daily.

I added this

So you may have noticed that I’ve got a new little tab thingy up at the top of my blog.

I’ve decided to make my attempt at 10,000 blog posts a legitimate goal. How interesting would it be to see the progression of a person over the span of approximately 27 years? Even though I’m like the least interesting person on the planet, I think just chronicling about a quarter of a century’s worth of time for anyone would prove an interesting study.

So yeah. Yet another Claudia Goal. Are you ready to put up with my inanity for another 21 years?

SCARY NOTIONS.

You can lead a Claudia to WordPress but you can’t make her update her blog

GOD I’M A BAD PERSON.

I’m really sorry, guys. I just  mass-posted 32 pages worth of blog and probably annoyed the crap out of any subscribers. Totally understand if you want to unsubscribe.

Anyway. School + work + joy of teaching stats kind of took over my life. I’d write my blog for the day and then I’d be like “oh, internet!” and get distracted enough that by the time I realized I hadn’t actually posted yet I’d think “bah, too late, must sleep,” so I’d stick it in a Word document.

Flimsy excuse is flimsy.

I’ll try to be better, I promise!
Will a guy riding a Ritz cracker make up for it?

Woah, flashbacks

I totally forgot I’d started this blog on MySpace until the little neural pathways that had formed over the course of four and a half years caused me to automatically scroll to the “MySpace” bookmark rather than the “Wordpress” bookmark and I was automatically logged into my old account.

Everything’s still there! I look horrible with long hair.

Crazy life.

Big Week o’ Blog Stats Celebration, Day 6: The Best of the Best of the Best, SIR!

It’s the last day of the big statistics marathon. Sad? I am. But I got a few new R projects coming, so you’ll be subject to those shortly.

Anyway. Today is less about stats analyses and more about just general naked-eye trends. What  questions we’re looking at today:
A. What are my most popular blogs by view count on WordPress?
B. What are some of the most popular search terms people have used to arrive at my blog?
C. What are some of the most hilarious search terms people have used to arrive at my blog?
D. Blogs/topics I think are worth sharing that didn’t make my Best Of list up top.

Part A.
I’ve been on WordPress since September 1st, 2010. Since then, my most viewed blogs have been:

  1. (153 views) Scrabble Letter Values and the QWERTY Keyboard
  2. (149 views) Colored Beats!
  3. (58 views) Oh look, PayPal wants me to fill out a survey
  4. (34 views) TWSB: Well, it certainly would make the cartographer’s job easier…
  5. (28 views) TWSB: Weebles Wobble (But They Wouldn’t if They Had Three Legs)
  6. (26 views) Pi vs. e
  7. (19 views) An analysis of statewise uniform population density (according to Craigslist)
  8. (19 views) Claudia’s 365 Days of Music – A Review
  9. (18 views) 5 x 20 seconds of fun

Those may not seem like tremendously large viewing numbers, but considering I’ve got over 2,000 posts and like three people who actually frequent Eigenblogger, 153’s not too bad. Part B explains some of the numbers.

Speaking of which…

Part B.
Top 10 search phrases are:

  1. “colored beats”
  2. “Leibniz porn”
  3. “what one thing could paypal have done to improve your experience with the account limitation process”
  4. le seul mot juste”
  5. eigenblogger”
  6. “scrabble letter breakdown”
  7. “scrabble letter values”
  8. “scrabble letters”
  9. “scrabble letter rank”
  10. “rho rho rho your boat”

Yes, a freakishly large amount of times my blog has been found have been because of somebody (sombodies?) searching for “Leibniz porn.” That is simultaneously awesome and confusing. Does “porn” mean something like “metaphysical texts” in some other language? If not, and at least one person out there is searching for legitimate calculus-oriented, ostentatious wig-wearing, best-of-all-possible-smut Leibniz porn, WHO THE HELL ARE YOU AND WILL YOU BE MY SOUL MATE FOREVER?!

Ahem.

Le Seul Mot Juste was the name of my blog up until like three months ago.

And “rho rho rho?” Who the hell knows. Maybe my intellectually-compatible-perfect-future-boyfriend-husband-thing (hereafter referred to as my ICPFBHT) was trying to make some sort of stats pun as he sat hunched over his computer keyboard in a darkened room, chugging Red Bulls and listening to electronica. Naked. With stacks of Leibniz’ works next to him.

*Sigh…*

Anyway.

Part C.
People have found my blog by searching for rather humorous things such as:

  • “jokes about leibniz cookies”
  • “analysis without anal”
  • “paddled in parachute pants”
  • “yo dawg science”
  • “jokes about godot”
  • “if your a noodle and you know it clap your hands” (yeah, I have no idea, either.)
  • “ who the hell is millard fillmore”
  • “gdrydfjgki”
  • “gavagai turnips”

Part D.
It’s shameless self-promotion time! I was going to make a big ol’ flowchart thing that showed you what blogs to go for depending on your general interests, but I’m lazy and I’m sure none of you readers really care that much, so you get this instead.

Got here via a statistics-related post and/or are interested in random recreational stats parties? Why not check out my blogs under the Statistics category?

Interested in philosophy?

What about science?

Hungry?

(Want to read me bitch about stuff?)

Haha, that’s all I got. So there you go! Six days’ worth of stats for six years’ worth of blogs. I hope to entertain you all for another six years at least.

Thank you for reading! Seriously. I’m not all about acquiring followers, but it is really nice to have regular readers. :)

Big Week o’ Blog Stats Celebration, Day 5: Overall Trends: Word Count and Gunning Fog Index

Today is mega trends day. I’ll be looking at blog-wide stuff like the overall changes in word count and the overall changes in the Gunning Fog Index. Woohoo!

Hypotheses:
A. The Word Count per blog has increased as time has gone on. That is, my blogs today are longer than my blogs when I first started.
B. The GFI per blog has increased as time has gone on.
C. There is no significant correlation between Word Count and the GFI.

Analyses!
I performed a regression (aka a glorified correlation in this case) between Word Count and Blog Number to determine if the number of words per blog has increased as time has gone on. Which indeed it has; predicting Word Count by Blog Number, the regression equation can be written as Word Count = 0.0613*Blog Number. Blog Number predicts a significant proportion of variance in the Word Count variable, F(1,2190) = 14.15, p < 0.001. Here is a plot. The red line is the regression line. As always, click on those bad boy plots to see them more clearly.

Same procedure for GFI vs. Word Count. The GFI, or Gunning Fog Index, remember, is a measurement of the readability of English writing and its values correspond to the number of years of formal education a person must achieve in order to fully understand the written passage. For example, a GFI of 10 suggests that an individual must have completed 10th grade in order to understand the material. To achieve near universal understanding, Wiki recommends that the GFI of a bit of text hover around an eight.

Anyway. The regression equation here is Word Count = 0.0008639(GFI). GFI predicts a significant proportion of variance in the Word Count variable, F(1, 2190) = 51.86, p < 0.0001. Here is another plot with another regression line.

Finally, I tested the correlation between Word Count and GFI. The correlation was -0.0028 but was not significant with t = -.0.1287, p = 0.8976.

Results:
A. Supported! The regression line isn’t very steep, but it’s significant still.
B. Supported! That’s actually a pretty impressive regression line, in my opinion.
C. Supported! There’s practically no correlation at all between the length of my blogs and the level of comprehension. I blame the surveys.

Woo!

Big Week o’ Blog Stats Celebration, Day 4: Common Words and Topics

Yay, I’ve been waiting for this day! Why? ‘Cause I get to use Wordle. I don’t have any hypotheses for today; rather, I have three main questions of interest.

Question A: how do my “commonly used words” change throughout the years?
Question B: are there some words I use more than others in my blog titles?
Question C: looking at my blog in total, what are my most commonly-used words?

Let’s go!

Question A: Using Wordle’s word counts, here’s a table of my top 10 words for each year (note: Worlde can automatically remove “common” words like the, and, a, etc., so I did that). Words consistently highly used across the years are colored.

(Year 1’s “Andy” is because of a short story I posted. Year 4’s “Hate” is because of grad school.)

Question B:
My top 10 words I use in my titles are:

  1. Blog
  2. Oh
  3. Survey
  4. Time
  5. Waiter (from all my “Waiter! There’s a…” titles)
  6. Claudia
  7. Day
  8. Just
  9. God
  10. One

Question C:
Here is a Wordle of my top 100 words spanning all six years!

I would have guessed I’d used the word “blog” a lot more. And my own name less. I use my name in my blog more than “haha” and I’m always dropping “haha”s all over the place! What.

Bonus: here are a few of my common phrases by year. A lot of these are biased because of one blog containing a repeating phrase, but they’re still amusing.

Year 1:

  • “Claudia is”
  • “Airplane airplane airplane airplane”
  • “Who cares about apathy”
  • “ag sci computer lab”
  • “if you had sex”

Year 2:

  • “the socio-adaptive force”
  • “who said hello”
  • “I can be absolutely fine”
  • “go ahead and stir baby” (haha, it took me like twenty minutes to try and figure out why this was a popular phrase; then I remembered it was because of this)
  • “the fact that I”

Year 3:

  • “wifey wifey wifey wifey”
  • “have you ever”
  • “best of all possible” (hahaha, this was the year I discovered Leibniz)
  • “the mad scientist’s life”
  • “the last time you”

Year 4:

  • “I hate this” (yup, grad school time)
  • “your conversational partner has disconnected” (and Omegle time)
  • “approach to environmental ethics”
  • “what do you want”
  • “today’s song”

Year 5:

  • “this week’s science blog”
  • “today’s song”
  • “you have no idea”
  • “for quite some time”
  • “all of a sudden”

Year 6:

  • “what do you think of”
  • “I miss happiness”
  • “what would it be”
  • “the last time you”
  • “sure why not”

Humorous.

Big Week o’ Blog Stats Celebration, Day 3: Titles, Surveys, and Images

It’s day three!

Today we’re looking at three different variables: trends in my Titles, the frequency of blogs involving Surveys, and the frequency of blogs involving Images

To make sense of these variables and the stats surrounding them, I had to code them. As I said in my first blog stats-related post this week, for the Titles variable, titles were coded 0 if they had nothing to do with the blog content whatsoever (e.g., “Do obedient consonants respond to a Q queue cue?”), a 1 if they were directly relevant to the blog content (e.g., “Greek letters as broken down by meanings in Statistics: a subjective and torturous endeavor”), and 2 if they weren’t completely unrelated but one couldn’t guess the blog content from the title (e.g., “ZOMG”). For the Surveys variable, I just coded the blog entry as 0 if it didn’t contain a survey and 1 if it did. Same thing for the Images variable—a 0 if there were no images and a 1 if there were one or more images.

So. Do I have any hypotheses? Of course I do!
A: The majority of my blog titles have nothing to do with the blog content (that is, they’re coded as 0).
B: I’ve posted more Surveys as time has gone on.
C: I’ve posted more Images as time has gone on.
D: Blogs with Images have fewer words than blogs without Images.

Analyses:
Quick initial analysis: a pie chart of titles!

Hahaha, a quarter of my blog titles tell you absolutely nothing about the associated blogs. That’s fantastic.

Now some more serious fun. To determine whether the amount of Surveys I’ve been posting has been increasing with time, I first made a graph that looks like a bar code to get a rough idea of the frequency/spacing of surveys in my blog*. Each black vertical line represents a Survey blog (y-axis runs from 0 to 1 but since Survey is coded as either a 0 or 1, the appearance of a line indicates Survey = 1).

Second, I looked at the correlation between Blog Number (blog 1 was May 1, 2006, blog 2,192 was May 1, 2012) and the presence of a Survey. The way the coding works, a positive correlation would indicate that as time progressed, I had a greater tendency to post a survey-containing blog.

So….

In this case, I did get a positive correlation of rpb = 0.071. This isn’t the usual Pearson r correlation because I’m not comparing two continuous variables; rather, it’s a point biserial correlation to accommodate the dichotomously-coded Survey variable. However, it’s mathematically equivalent to the Pearson r, so I felt comfortable running a test of significance on the correlation. Turns out, the little .071 correlation is statistically significant, t = 3.346, p < 0.001. This means that the true correlation between Blog Number and the number of surveys I post is not zero and I’ve been posting more and more surveys as time has gone on.

Taking the same procedure with the Blog Number variable and the dichotomous Image variable, here’s another bar code-esque pic (black lines = blogs containing 1+ image):

Here we get an even stronger correlation of rpb = 0.194, which is statistically significant, t = 9.273, p < 0.0001. This shows that the true correlation between Blog Number and the number of Images my blog contains is not zero, and I’ve been posting more and more blogs containing an image as time has gone on.

Finally, I checked out word count between all blogs with Images and all blogs without Images. I made two subset data sets, one containing all the blogs with images, one containing all the blogs with no images, and ran a t-test. The difference in word count was (to me) surprisingly large and definitely significant, t = 6.658, p < 0.0001. The actual means of the No Image vs. Image blogs were 290.425 words and 177.925 words, respectively.

Now results:
Hypothesis A: Haha, totally not supported, and actually opposite: most of my Titles ARE directly relevant to the content. That’s…surprising to me. I name my blogs right before posting them (which is usually like a decade and a half after I write them, given how often I update this blog), and I’ve usually used the “mash the keyboard until the letters make sense” approach to titles. That, or “let’s see what dumb pun I can make today!”
Hypothesis B: Supported! This is probably strongly due to the fact that I’m working to complete the 5,000 Question Survey and have been working on it since late 2010.
Hypothesis C: Supported! WordPress makes it substantially easier to include images than MySpace ever did. Also, more time spent on the internet now = more random humorous images found via StumbleUpon/Tumblr/other blogs/etc.
Hypothesis D: Very supported. The actual word count difference between blogs with and blogs without Images was surprising to me, though the sample size difference could probably be considered a culprit. However, I guess it shouldn’t be too surprised, though; going through the archives I found quite a few blogs that were like “here’s an image!”, the image, and nothing else.

*Yeah, I know there’s got to be a more sophisticated way to represent this. Creating a CDF doesn’t work with a dichotomous variable. Maybe if I write a loop that adds all the preceding 1’s to each instance of a 1 it hits as it goes from Blog Number = 1 to Blog Number = 2193, and then create sort of a pseudo-CDF using that…hmm…next week’s project!!!

Big Week o’ Blog Stats Celebration, Day 2: Mean Word Comparisons by Weekday, Month and Year

Yo, blogland! Time for another round of “stats no one cares about except me!”

Today we’re looking at Word Count by Day of the Week, Month, and Year. I’d like to see if there are any general trends or if I blather on about nothing in relatively consistent bursts across time. Maybe if all these days of analyses reveal some trends, I could try fitting a model to this data. I loves me some model fittin’.

Onwards and upwards!

Hypotheses:
A. No one day of the week will have a statistically significant difference in word count than any other day of the week. I don’t think I blog more or less over the weekend, and I see no reason why any day of the five-day week would have longer blogs than any other.
B. I don’t know if they’ll be significant or not, but I’m predicting that word counts will in general be higher during the spring school months (January– April at least) than the summer/winter months. The more responsibilities I have, the more I turn to blogging for procrastination, and I usually take more credits in the summer.
C. From highest word count to lowest: Year 6, Year 2, Year 5, Year 4, Year 1, Year 3.

Ready? Analyses:
Here is a pie chart (a tasty, tasty pie chart!) of the percentage of words I’ve written by the day of the week.

Pretty equal, eh? But what does the ANOVA say?  According to the stats, there are no statistically significant differences in word count by day of the week, F = 0.642, p = 0.697. According to the Tukey HSDs, none of the individual pairs of days of the week are statistically significant in terms of their word count, either.

Here is another pie chart. This one shows percentage of words by month.

Again, pretty even. Stats? F = 1.505, p = 0.123, meaning that there are no statistically significant differences in word count by month. No statistically significant differences in any of the pairs of months, either.

Finally, we jump to the largest span of time I’m looking at: years! Pie pie pie pie pie:

Haha, holy crap, Year 6 and Year 2 combined account for nearly half of the words in my total blog. Poor little Year 3.

And finally we see some significance! There is a statistically significant difference in word count by blog year , F = 11.021, p > 0.001.

Results:
Hypothesis A: Supported! All days of the week are subject to equal amounts of my blathering. Poor things.
Hypothesis B: Eh. Technically January, February, March, April, and May are the wordiest months, but they’re not significantly so.
Hypothesis C: Woo! I totally called it. If anyone’s curious, Year 3 was a word drought because I was living in the house with the guys and I had…other stuff occupying my time.

More to come tomorrow, ladies and gents!

Big Week o’ Blog Stats Celebration, Day 1: Mean Word Comparisons by Category

STATS TIME! Are you excited?

First, I want to preface all of this with the list of variables I kept track of when going through my blog archive:

  • Blog Number. My first blog is coded as 1, the second as 2, the third is 3, and so on up until 2193.
  • Year. Which blogging year the blog came from. There are six years, each spanning May – May.
  • Month. January, February, etc.
  • Day. The 1st of the month, 2nd of the month, etc.
  • Weekday. Monday, Tuesday, etc.
  • Word Count. Word count of each post, not counting the title.
  • GFI. Gunning Fog Index.
  • Punctuation. How many punctuation marks the post contained.
  • Title. 0 = title unrelated to blog content, 1 = title directly relevant to blog content, and 2 = ambiguous title; could be related or unrlated.
  • Survey. 0 = blog does not contain a survey, 1 = blog contains survey.
  • Image. 0 = blog does not contain any images, 1 = blog contains 1+ image(s)
  • Category. What category did I tag my blog as (details below).

ALSO NOTE: significance is always judged at the p = 0.05 level. Just didn’t want to have to keep specifying that. :)

So! Today we’re looking at Categories. There are 35 of them (or there will be once I go through and delete all the old “defunct” tags from the few blogs that still have them). Here’s the list in case anybody gives a crap:

So what are we looking at within this sexy, large dataset with respect to categories, then?

Questions of Interest
A)
What is the distribution of the categories? That is, which categories are most popular and which are hardly ever used?
B) Do certain categories have a statistically significant different amount of words per post than the other categories?

Hypotheses
A: The most popular categories (by percent) will be Blogging, School, and probably Surveys.
B: The least popular categories will be Ramblings and Sports.
C: Categories with a significantly different number of words per post will be Surveys, Philosophy, and Rants.
D: The three categories specified in Hypothesis C will have higher word counts, not lower.

LET’S DO THIS NOISE.

Analyses
First up, a pie chart! This was my first attempt at visualizing category percentages. By the way, I definitely would have titled this like a good little statistician, but I couldn’t get the image large enough (in my opinion) with the title included. So I’ll call it Percent of Blogs by Category (NOT percent of words by category; that’s just in the ANOVA below).

I had to screw around with this a lot to get it in the easiest to read color scheme. Pie chart with 35 slices = not the best visual, but I think it’s still better than a bar graph in this case.

Table o’ actual counts (click to blow it up so it’s actually readable, haha):

God, all those Blogging blogs.

Second: ANOVAs! Well, okay, just one. But it’s an ANOVA!

According to a more in-depth, ANOVA-driven analysis…

  • The mean Word Count per blog is statistically significantly different depending on blog Category, F = 23.184, p < 0.001.
  • Blogs in the Surveys category have a significantly higher word count than the other categories, t = 7.739, p < 0.0001.
  • Blogs in the Writing category have a significantly higher word count than the other categories, t = 3.624, p < 0.001.
  • Blogs in the Philosophy category have a significantly higher word count than the other categories, t = 3.365, p < 0.001.
  • Blogs in the Rants category have a significantly higher word count than the other categories, t = 2.480, p < 0.05.

I (or R, rather) also computed a buttload of Tukey HSDs (595 of them!) to test the mean differences between each pair of categories, but most of the significant ones involved (as expected) Surveys, Writing, Philosophy, and Rants.

So. Results:
Hypothesis A: supported! Blogging and school, man: my life.
Hypothesis B: mostly supported! There were a few categories that had nearly as few entries as Sports. I’d get rid of the Rambling category, but then I’d have 34 categories, which isn’t a nicely-dividable number like 35 (I like numbers ending in 0 or 5).  Guess I just need to ramble more.
Hypothesis C: mostly supported! I’d totally forgotten about Writing.
Hypothesis D: supported! Surveys, Writing, Philosophy, and Rants contained blogs that had higher than average word counts.

Cool, huh?

Tune in tomorrow for more stats no one cares about except me!

Blog #2,193: Six Years

My blog from May 1, 2006:

“Alrightythen! I finally got a MySpace*. People will now find it easier to stalk me. Anyways, since I’m obsessive-compulsive, I had to start this on the 1st of a Month–I wanted to start this a week or so ago, but NO–that would be in the middle of April. I’m tired now and I want to go to bed, but NO–that would mean I would have to wait another month to start.
Anyway, it’s all good. I’ll be here posting my thoughts every day. At least, that’s what I want to do.

The odds of it happening?

50:1.
Meh.”

Oh, silly little high-schooler. If only you knew.

Yes, the internet has been putting up with my daily drivel for six years now. Scary stuff, huh? Haha, I should actually blame Aneel and E’raina for this…they’re the ones that peer-pressured me into getting a MySpace and starting a blog. DO YOU SEE WHAT YOU’VE DONE?!

Anyway, yay! I’ve spent the past two weeks or so going through each and every one of my posts to create a nice large data set. Because I spent so long doing it, I thought it only fitting that I not confine my statistical exploration of my blogs to one night (tonight). Therefore, I will give you six days (one for each year I’ve been blogging) of stats/blogging insanity guaranteed to either interest you or make you wonder why the hell you even read this. Or both. Or neither. Maybe it’ll give you cheese. The possibilities? Endless.

This is what’s going down in the Big Week o’ Blog Stats Celebration:

  • Wednesday: Comparisons by blog category
  • Thursday: Mean word comparisons by weekday, month, and year
  • Friday: The use and frequency of images and surveys
  • Saturday: Common words and topics
  • Sunday: Overall trends: Gunning Fog Index and word count
  • Monday: The best blogs

Excited? I am! Any excuse to do some recreational stats.

Here’s to six more years (at least)!

*YES, I started out on MySpace. Give me a break, it was the thing back in 2006, you know it was.

It’s my WordPress and I’ll post when I want to

Blahasfhsalrghs sorry guys.

Actually, I should be more specific, huh?

1. Sorry for getting around to posting these like every 25 days or so, and
2. Sorry for the extremely erratic posting shenanigans of the past two days.

With regards to #1:
I know I’ve been totally slacking when it comes to getting these posts up, so I want to apologize for making you all wait (‘cause I know you totally wake up in the morning thinking, “I wonder if Claudia’s blog is updated? Oh sweet Jesus, let her blog be updated!”) and also for inundating those of you who’ve subscribed with dozens of  “Claudia has posted a new blog!” alerts all at once.

With regards to #2:
This one wasn’t entirely my fault (or so I’m going to claim). Yesterday I posted a whole slog of blogs within like 30 seconds because I had them all saved as drafts and I just had to press one button to get them published. Anyway, I think WordPress saw it as some sort of spam blogging account and put a block on my posting abilities. I contacted them and it got resolved this morning. That’s why you saw like 10 blogs yesterday and the rest of them today.

Also, you guys are super quick to comment (TOTALLY NOT A COMPLAINT, I love you all). But it’s 10 PM and I just got home from work, so it may be a bit before I acknowledge your comments, haha.

Anyway.

A new blog year is coming up for me TOMORROW, so I’ll make it one of my “New Year’s” resolutions to try and get these things posted within two or three days of their creation. I know it’s not a big effort to copy/paste/press “post” but I’m a lazy butt. And I’ll update my “About” page finally.

Apologies again, and hopefully this won’t happen anymore.

 

 

(Haha, I can dream, right?)

It’s that time again…

HI FOOLS!

So in a little less than a month I’ll be hitting yet another blogging  milestone: SIX YEARS!

While years four and five were full of fun stats, pics, analyses, Wordles and the like, I think year six will have even more.

I’m going to go through each and every blog so that I can have data from every single day (not just every month, like I had done for last year). There are a few things I’d like to examine:

1. Do I have a statistically higher wordcount on any particular day(s)?
2.
Using the Gunning Fog Index as a gauge, how does my blog readability change over the course of the last six years?
3.
Have I been adding more images to my blog posts overall as time has progressed?
4.
Do my blogs change in terms of length/complexity/readability based on the time of the year?
5.
And a bunch of stuff I can’t remember now.

WOO DATA!