INDEX
Explanations
references to the word "speeches"
references to paid speeches
New Auto-Interp
Negative Logits
runs
-0.73
appropriate
-0.71
Wonderland
-0.68
Nation
-0.68
Tier
-0.67
Captain
-0.66
Kay
-0.63
avery
-0.62
abol
-0.62
Grimm
-0.61
POSITIVE LOGITS
speeches
1.52
lectures
0.87
esta
0.85
concerts
0.83
clinton
0.81
vows
0.78
seminars
0.78
memos
0.78
announcements
0.78
pitches
0.77
Activations Density 0.014%