INDEX
Explanations
references to literacy and literature
references to literacy and its various forms
New Auto-Interp
Negative Logits
Sands
-0.73
Parenthood
-0.69
swer
-0.69
Ĥª
-0.65
cffffcc
-0.62
ktop
-0.61
IELD
-0.60
rontal
-0.60
ressing
-0.59
palms
-0.59
POSITIVE LOGITS
atures
1.28
acy
1.10
acies
1.08
uania
1.04
ature
0.96
umen
0.91
liter
0.88
ocity
0.85
ging
0.85
liter
0.84
Activations Density 0.020%