Hi, I’m Naitian.

That’s pro­nounced naɪtjen like 🌙 💴.

I’m a PhD stu­dent at the UC Berkeley School of Information, where I’m ad­vised by David Bamman and sup­ported by the NSF grad­u­ate re­search fel­low­ship.

My re­search in­volves ap­ply­ing large-scale data analy­sis and deep learn­ing meth­ods to study­ing cul­ture, of­ten through a vari­a­tion­ist so­ci­olin­guis­tic an­a­lytic frame­work. This spans the fields of NLP, com­pu­ta­tional so­cial sci­ence, and cul­tural an­a­lyt­ics. I also care a lot about the news, data jour­nal­ism, data vi­su­al­iza­tion and cross­word puz­zles.

If you are in­ter­ested in do­ing re­search or grad school, I am al­ways happy to chat about my ex­pe­ri­ences. If you are a Berkeley un­der­grad in­ter­ested in do­ing cul­tural an­a­lyt­ics re­search, you should ap­ply for a UROP opportunity with my ad­vi­sor, David Bamman.

A collection of photos of my face.

One pic­ture is hard to iden­tify a per­son with” ~ David Fouhey

Updates

Publications

Social Meme-ing: Measuring Linguistic Variation in Memes

Much work in the space of NLP has used com­pu­ta­tional meth­ods to ex­plore so­ci­olin­guis­tic vari­a­tion in text. In this pa­per, we ar­gue that memes, as mul­ti­modal forms of lan­guage com­prised of vi­sual tem­plates and text, also ex­hibit mean­ing­ful so­cial vari­a­tion.
[Website] [PDF]

Condolences and em­pa­thy in on­line com­mu­ni­ties

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
In times of dis­tress, we fre­quently go on­line to seek so­cial sup­port and con­do­lence. But ef­fec­tively pro­vid­ing that sup­port to oth­ers is eas­ier said than done. This study aims to com­pu­ta­tion­ally iden­tify mech­a­nisms and strate­gies for de­liv­er­ing ef­fec­tive and im­pact­ful con­do­lence on so­cial me­dia.
[Website] [PDF]

Code

I am writ­ing or have writ­ten code for: The Michigan Daily as the man­ag­ing on­line ed­i­tor, NBC News as a Data Graphics in­tern, the Michigan Data Science Team as a pro­ject leader, Capital One as a soft­ware en­gi­neer­ing in­tern (x2 sum­mers) and, of course, my­self as naitian.

Here is a sam­pler of my work.

The Spalling Bie
The Spalling Bie is just like the New York Times Spelling Bee, ex­cept you only get points for fake words that sound plau­si­ble. [Link]
Where are the vac­cine deserts?
I did re­search, data col­lec­tion, analy­sis and graph­ics for an NBC News story track­ing where Americans could ex­pect to find phar­ma­cies that would carry the Covid-19 vac­cine. [Link]
Cover Story
As part of a chal­lenge, I used a data­base of book ti­tles from Amazon and a part-of-speech tag­ger to con­struct gram­mat­i­cally cor­rect sen­tences us­ing only book ti­tles. Earned an hon­or­able men­tion from Randall Munroe, cre­ator of XKCD. [Link]
If you start play­ing…
A dif­fer­ent kind of New Year’s count­down, IYSP takes in­spi­ra­tion from the in­ter­net memes about starting the year off right.” [Link]