INDEX
Explanations
proper nouns related to Japanese culture or names
proper nouns, particularly names and places
New Auto-Interp
Negative Logits
Jenner
-0.83
illard
-0.74
Richards
-0.72
wards
-0.72
Bills
-0.70
Pry
-0.67
arations
-0.66
town
-0.66
Bulg
-0.66
Dillon
-0.66
POSITIVE LOGITS
ibaba
0.90
imaru
0.79
Û
0.74
ikawa
0.74
=-=-
0.74
igslist
0.73
ogene
0.73
ococ
0.72
bach
0.72
ItemTracker
0.71
Activations Density 0.031%