I love this paper on communication styles in the Linux kernel:
http://www.opensym.org/os2016/proceedings-files/p101- schneider.pdf
Training a classifier to distinguish two particular authors, leaders in the Linux development community, based on lexical choices. Use of "sorry", "thanks", "actually", "never" and expletives are most discriminating.
It makes me wonder whether this would also be an interesting characteristic of one mailing list compared to another, in addition to distinguish individual authors. "Where does your open source community fall on the Actually-Thanks Spectrum (TM)?"
I'd love to see this as part of BigBang, particularly if that kind of lexical analysis or Bayes classification would be useful for lots of research questions.
Thanks,
Nick
_______________________________________________
BigBang-dev mailing list
BigBang-dev@lists.sudoroom.org
https://sudoroom.org/lists/listinfo/bigbang-dev