INDEX
Explanations
locations, names of cities, and surnames
proper nouns, particularly names and locations
New Auto-Interp
Negative Logits
reddits
-0.89
aji
-0.82
ask
-0.80
atur
-0.77
enium
-0.76
ivas
-0.74
imes
-0.74
amara
-0.73
aken
-0.73
shire
-0.73
POSITIVE LOGITS
Strait
0.83
Pavilion
0.75
nomine
0.75
jad
0.71
Hayward
0.66
Cors
0.65
CLASSIFIED
0.63
BIL
0.63
Townsend
0.62
furt
0.62
Activations Density 0.066%