INDEX
Explanations
interviews or conversations
phrases indicating interviews or discussions with specific subjects
New Auto-Interp
Negative Logits
ILCS
-0.89
battle
-0.76
acci
-0.70
gra
-0.70
thereof
-0.68
years
-0.67
.''.
-0.66
separ
-0.65
fuck
-0.65
fights
-0.64
POSITIVE LOGITS
reporters
0.98
Oprah
0.94
filmmaker
0.94
Vanity
0.93
Rolling
0.90
Conan
0.90
CNBC
0.89
journalist
0.89
Wired
0.88
interviewer
0.88
Activations Density 0.074%