INDEX
Explanations
references to military service and personnel
New Auto-Interp
Negative Logits
âϦ
-0.65
PI
-0.64
etting
-0.60
anymore
-0.60
bie
-0.58
hack
-0.57
orph
-0.56
owe
-0.56
bery
-0.56
VR
-0.56
POSITIVE LOGITS
been
1.29
undergone
1.14
begun
1.12
previously
1.12
iths
1.05
gone
1.02
flown
1.01
originally
1.00
gotten
0.99
been
0.99
Activations Density 0.143%