INDEX
Explanations
proper nouns, particularly names
names and references related to specific individuals and factors connected to them
New Auto-Interp
Negative Logits
bre
-0.74
arov
-0.73
ĸļ
-0.73
igel
-0.70
ãĥĩãĤ£
-0.68
berra
-0.67
lies
-0.67
uyomi
-0.67
encers
-0.66
enza
-0.66
POSITIVE LOGITS
LR
1.12
LR
1.02
Lank
0.75
SECTION
0.74
SE
0.72
RAM
0.70
riott
0.69
andom
0.66
NX
0.66
outine
0.66
Activations Density 0.022%