INDEX
Explanations
spokespersons mentioned in various contexts
references to spokespeople and their statements
New Auto-Interp
Negative Logits
ech
-0.78
1982
-0.67
spir
-0.66
ãĤª
-0.65
Carbuncle
-0.64
avery
-0.63
pired
-0.63
awed
-0.63
abol
-0.62
ever
-0.62
POSITIVE LOGITS
spokesman
1.14
spokeswoman
1.00
spokesperson
0.96
arten
0.81
Debor
0.80
spokes
0.74
glim
0.72
answ
0.72
staffer
0.72
orically
0.72
Activations Density 0.013%