INDEX
Explanations
references to news reporters or media personnel
New Auto-Interp
Negative Logits
osate
-0.69
ez
-0.67
UTH
-0.66
ston
-0.66
inho
-0.65
tein
-0.65
UCT
-0.63
phal
-0.63
tail
-0.62
uras
-0.61
POSITIVE LOGITS
gathered
0.90
hips
0.89
afterward
0.88
aboard
0.85
reporters
0.82
stationed
0.82
assembled
0.79
Thursday
0.76
hip
0.76
afterwards
0.75
Activations Density 0.015%