INDEX
Explanations
references to authority figures and their roles or statements
Comes after a name or title
names after commas
New Auto-Interp
Negative Logits
du
-0.51
PR
-0.46
z
-0.46
SC
-0.45
Hotspur
-0.45
PR
-0.44
java
-0.44
.
-0.43
ব
-0.43
pae
-0.43
POSITIVE LOGITS
Monfieur
1.09
########.
1.03
متعلقه
1.02
auffi
1.02
TagMode
1.00
iſt
0.96
Efq
0.95
ſelf
0.95
parsedMessage
0.95
#+#
0.94
Activations Density 0.714%