On Aug 31, 2016, at 4:35 PM, Sebastian Benthall
<sbenthall(a)gmail.com> wrote:
I actually love this damn good idea, Nick. Great find. Let's do something like this
in version 0.3
On Wed, Aug 31, 2016 at 3:16 PM, Nick Doty <npdoty(a)ischool.berkeley.edu
<mailto:npdoty@ischool.berkeley.edu>> wrote:
I love this paper on communication styles in the Linux kernel:
http://www.opensym.org/os2016/proceedings-files/p101-schneider.pdf
<http://www.opensym.org/os2016/proceedings-files/p101-schneider.pdf>
Training a classifier to distinguish two particular authors, leaders in the Linux
development community, based on lexical choices. Use of "sorry",
"thanks", "actually", "never" and expletives are most
discriminating.
It makes me wonder whether this would also be an interesting characteristic of one
mailing list compared to another, in addition to distinguish individual authors.
"Where does your open source community fall on the Actually-Thanks Spectrum
(TM)?"
I'd love to see this as part of BigBang, particularly if that kind of lexical
analysis or Bayes classification would be useful for lots of research questions.
Thanks,
Nick
_______________________________________________
BigBang-dev mailing list
BigBang-dev(a)lists.sudoroom.org <mailto:BigBang-dev@lists.sudoroom.org>
https://sudoroom.org/lists/listinfo/bigbang-dev
<https://sudoroom.org/lists/listinfo/bigbang-dev>