INDEX
Explanations
proper nouns or names starting with 'Ny' or 'Vor'
specific names or terms, particularly people's names
New Auto-Interp
Negative Logits
eering
-1.02
ably
-0.84
IBLE
-0.74
smoking
-0.71
ÙĦ
-0.71
ablishment
-0.68
TAIN
-0.68
urdue
-0.68
________________________________
-0.67
ï
-0.66
POSITIVE LOGITS
Ny
1.30
quist
0.87
borg
0.87
ota
0.86
©¶æ¥µ
0.82
yk
0.81
acht
0.75
©¶æ
0.75
wana
0.75
yss
0.75
Activations Density 0.004%