INDEX
Explanations
words related to commuting or official military positions
terms related to community engagement and communication
New Auto-Interp
Negative Logits
other
-0.66
Sloven
-0.66
nerv
-0.64
unders
-0.63
Euro
-0.61
Feldman
-0.61
Finnish
-0.61
under
-0.60
Painter
-0.60
Eisen
-0.60
POSITIVE LOGITS
rals
0.80
ruary
0.79
rencies
0.78
iton
0.76
issance
0.73
urations
0.73
uted
0.73
rocal
0.70
berus
0.69
urate
0.69
Activations Density 0.030%