INDEX
Explanations
references to rankings or numerical positions
punctuation marks, particularly periods
New Auto-Interp
Negative Logits
lag
-0.64
etheless
-0.60
staking
-0.60
ãĥ¼ãĥĨ
-0.59
confidentiality
-0.55
luc
-0.54
disturbed
-0.54
enthus
-0.54
pear
-0.54
herds
-0.52
POSITIVE LOGITS
1
0.91
2
0.84
1
0.81
3
0.80
8
0.77
4
0.76
7
0.76
6
0.75
22
0.75
5
0.73
Activations Density 0.024%