INDEX
Explanations
references to specific individuals, possibly including names and titles
New Auto-Interp
Negative Logits
ãģĨ
-1.01
ISC
-1.01
mberg
-0.99
Across
-0.98
nikov
-0.91
giving
-0.83
âĶģ
-0.83
Enabled
-0.83
abiding
-0.82
anwhile
-0.82
POSITIVE LOGITS
cci
1.19
pling
1.18
ching
1.17
plet
1.15
alog
1.14
Pont
1.13
gey
1.13
iple
1.08
illard
1.08
pee
1.07
Activations Density 0.774%