INDEX
Explanations
references to specific individuals and their accomplishments
proper nouns, specifically names and locations
New Auto-Interp
Negative Logits
japanese
-0.67
Japanese
-0.67
VAO
-0.65
япон
-0.63
Japanese
-0.61
japonais
-0.61
Référence
-0.58
日本人
-0.54
enumi
-0.54
lenker
-0.53
POSITIVE LOGITS
Hank
0.65
Okay
0.62
Hank
0.59
Send
0.57
VIAF
0.55
setopt
0.54
osus
0.52
Okay
0.51
Send
0.49
ományos
0.49
Activations Density 0.204%