Once Upon a Beta: My NSSA Experience
I had the privilege of being one of the beta testers for NextStage Sentiment Analysis (NSSA) over recent weeks, and this is a total rewrite of my original report of my experience. Doing sentiment analysis on your own writing can be quite a revelation – it can definitely put you in your place. First of all, let me tell you a couple short things about me. I have worked in web analytics for about ten years. I view NextStage’s work as a kind of brave new world (that has such people in it!) that – even with careful and patient guidance – feels just beyond the edge of what I can comprehend. I will tell you honestly that, unlike (probably) other beta testers, I have not experienced a need for sentiment analysis in my job. What I am saying is that my testing approach and my desire to be involved came simply out of analytical curiosity. I didn’t come with a pile of case studies, or anything like that. I am going to do my best here to tell you about my experience. I invite you to ask me any questions that you think I may be able to help with, and I understand if you’d feel more comfortable ignoring this kind of layman-style review.
I am going to start by telling you why this is a total rewrite, even though you’ve probably guessed. I ran sentiment analysis on my first draft and it flagged high for some things I felt were fairly negative, and indicated I would be doing any readers a disservice. I will start with what was not surprising, and explain it to you. I flagged very low for confidence (the scale originally conceived as the BS meter). This is not surprising – as I described above, I am no expert on sentiment analysis (by a LONG shot) and I don’t think I spent enough time being clear about that in my original draft. And despite Joseph Carrabis’s efforts, I always feel a bit of an interloper here in this world of neuroscientific analysis. However, I grasp what my scientific value to the beta effort is: analyst. So I hereby am attempting to provide you with analysis, which should go better. I am sure my confidence will still be low, because that’s my nature, but it shouldn’t be quite so abysmal.
What surprised (and cowed) me was that I also scored very highly for “Retribution” and “Troll”. I see why I scored high on Troll. My tone in the first write was flippant, in a failed effort to project a bit of confidence. I made a few jokes inside, mostly of the variety that would only be funny to me, referencing Dr Seuss and throwing in statements informing you that you could ignore that, etc. Retribution really knocked me over though, and I’ve spent a few days pondering it (without reaching out for help, just as a test). I’ve decided that it was two factors: because of the research I did for my review and my references back to web analytics (which were largely flippant and probably didn’t help my Troll score either). But let me talk about that separately. I hope you’ll stick with me here!
If you’ve read the previous postings on sentiment analysis (I put a list of the ones I found most useful at the end of this posting), you know that in NextStage’s view you need multiple dimensions of data to determine sentiment truly. In preparation for writing this review article, I read a few articles on sentiment analysis. I didn’t do this in a particularly scientific way, I just sort of “turned up” my sensitivity to the term in my regular reading. I took this approach because I wanted to get a feel for what my peers (web analysts) thought sentiment analysis was. Or where it was failing, what was needed, etc. Every article I read talked about scoring a statement (usually a Tweet) as positive, negative, or neutral. Sometimes neutral was omitted. Because I did this “research” after doing my beta testing, you can imagine I had some preconceived notions about why this was inadequate. I likened these people to those that use web analytics only for Hits! (pretty cruel, really) Now, you (or heck, even Joseph) may argue that that can’t cause the retribution flag. That was the best I could come up with after pondering.
Which brings me to a very key point: I had the benefit of Joseph’s help through the beta process. The NSSA results (in current form – I understand plans are to work on this) are really not information that the average analyst can walk up and interpret. I don’t tell you this to trouble you; I am just telling you that the data is nuanced. My first beta interpretation was wrong in so many ways that it could easily fill another post. And, even though I’ve described reports and values individually, they are actually interdependent. For instance, not only was my confidence low, but I scored low on trust and affinity. That’s because I have a very limited idea of who follows Joseph – I know you’re not all web analysts. There’s a good chance some of you think web analytics is total bunk pseudo-analysis! I do not expect that number to improve on this review. I am struggling to give you a clear picture of how the various values interweave, but they truly do, which is a key point. Just like you cannot rely on a single isolated metric or KPI in web analytics, you cannot rely on a single metric in sentiment analysis.
So, to summarize before this gets so long as to be unreadable: the analysis is eerily accurate and eye-opening to say the least. If you write or read, you will probably find yourself in need or at least in want of this tool at some point in your life. I find no evidence out there that there is a comparable tool at your disposal, so when you find yourself needing to know author sentiment (including your own sentiment) you will come back to NSSA.
Thank you for reading,
Jen
Twitter: @jdaysy
Skype: cmjenday
P.S. As promised, the four postings that I leaned heavily on during the beta:
- http://www.bizmediascience.com/2009/06/sentiment_analysis_anyone_part.html
- http://www.bizmediascience.com/2009/06/canoeing_with_stephane_sentime.html
- http://triquatrotritecale.hungrypeasant.com/?p=25
- http://triquatrotritecale.hungrypeasant.com/?p=40




Howdy and our thanks for your kudos, Jen.
One correction I would offer is to your “However, I grasp what my scientific value to the beta effort is: analyst.”
You were invited to take part because we trusted you to document your experience truthfully even if your experience showed us unfavorably. It’s always rewarding when people think well of you, it’s better when people speak their truth of you. Finding people who will speak truth regardless of outcome is a guiding principle in NextStage’s selection of people to help us conduct our research, contribute to whitepapers, perform tests and take part in betas.
And we’re so relieved you liked it (my inherent lack of confidence coming through again).
Thanks,
Joseph
lol, okay – fair enough! Continuing on the vein of documentation, I read the new descriptive material that was added to NSSA this morning, and it is helpful, but I still don’t understand why I score so high on Retribution, and it bugs me GREATLY. I verified the dictionary definition before I read yours and I was hopeful that there was perhaps a broader meaning (like that I was seeking some reward – in this case, feedback and pats on the head from readers and you), but I can’t imagine who or what or whom or whatever I’d be seeking to “punish” or get back at! And if it is that deeply subconscious, I am troubled that an author has the potential to be judged by it. Can you help me understand this better?
I’ve also decided I am no longer troubled by my high Troll score. I suspect everything I write would score high on Troll – I wouldn’t do things like this if they didn’t entertain me-me-me-me. I’m egotistical like that. I wish I still had some of my old requirement elicitation writing … I bet even they would score high for Troll!
-Jen
Howdy,
Troll…yes…the first audience any author must please is themself.
one possibility is that the Retribution filter is turned up too high (an Occam’s Razor solution). I’d want to see lots of material going through before I came to that conclusion, though.
Another possibility (one more likely in general, me thinks) is that “Retribution” can also be a variant of “wanting to view from a distance” or “wanting to keep things at a distance”. I would need to see the whole report to determine if these were the more likely causes for a high Retribution value.
Hope this helps – Joseph
helpful as always, thanks!
I read about NSSA some days ago in another blog and the main things that you mention here are important. How did you get to be a beta?
JEN!!! How in the world did I miss your NSSA beta test post!?! (Well, maybe not knowing you at the beginning of February contributed… but still!).
Just a tad (read as: enormously) jealous you got to beta!
Hi Milford, no big secret method to being in the beta, I just asked. I follow @JosephCarrabis on Twitter and saw him mentioning beta testing and for some reason he said it was okay for me to come onboard!
lol, Adam – make sure to ask to beta the next Wonderment!
This blog really rocks! :)
Tezza,
Thanks for reading and commenting, and on behalf of Jen, thanks.
Joseph