INDEX
Explanations
references to Japanese culture or individuals
New Auto-Interp
Negative Logits
حوالہ
-0.37
numerusform
-0.35
culoare
-0.34
簗
-0.33
Atiku
-0.33
Tahoe
-0.33
zyg
-0.33
-0.33
ZIE
-0.31
Khartoum
-0.31
POSITIVE LOGITS
Japanese
1.73
Japan
1.72
Japón
1.52
Japan
1.52
Jepang
1.50
japan
1.46
Japanese
1.46
JAPAN
1.44
Japon
1.43
япон
1.43
Activations Density 1.070%