INDEX
Explanations
place names and names of people and organizations involved in politics or crime
political/legal events
New Auto-Interp
Negative Logits
_
-0.42
seu
-0.38
Prev
-0.37
förm
-0.37
previous
-0.36
cat
-0.36
<eos>
-0.35
cabe
-0.35
A
-0.35
ce
-0.34
POSITIVE LOGITS
Efq
0.95
myſelf
0.91
Monfieur
0.91
itſelf
0.90
ſelf
0.88
themſelves
0.87
Houſe
0.86
ſind
0.85
correctes
0.84
houſe
0.84
Activations Density 4.046%