INDEX
Explanations
proper nouns related to individuals, particularly names
New Auto-Interp
Negative Logits
izoph
-0.55
polar
-0.54
Xi
-0.54
unsustainable
-0.54
apex
-0.53
Scotland
-0.53
Temperature
-0.53
entail
-0.52
unfavorable
-0.51
ultras
-0.51
POSITIVE LOGITS
Jr
1.14
baum
1.00
witz
0.96
owski
0.94
meyer
0.93
berger
0.92
lein
0.91
baugh
0.91
owicz
0.90
bach
0.89
Activations Density 0.170%