INDEX
Explanations
mentions of comments and sell-related phrases
New Auto-Interp
Head Attr Weights
0:0.10
1:0.04
2:0.11
3:0.04
4:0.06
5:0.04
6:0.19
7:0.03
8:0.04
9:0.24
10:0.02
11:0.02
Negative Logits
semin
-4.16
CoC
-3.98
Charlottesville
-3.63
McAuliffe
-3.50
bapt
-3.49
hist
-3.48
Albania
-3.46
Marina
-3.46
doms
-3.45
Alban
-3.43
POSITIVE LOGITS
Smart
8.91
Smart
8.66
smart
7.33
smart
7.25
smartest
5.52
smarter
5.34
Intelligent
4.75
intelligent
4.35
IQ
4.35
elligent
4.19
Activations Density 0.016%