INDEX
Explanations
references to Chinese or Japanese identity or culture
Chinese and Japanese nationalities
New Auto-Interp
Negative Logits
wymiar
-0.63
tričko
-0.56
awtextra
-0.54
вгений
-0.54
ltä
-0.53
kapturem
-0.53
Nachmittag
-0.52
sentiers
-0.52
fromnode
-0.52
torebka
-0.52
POSITIVE LOGITS
Chinese
0.68
Japanese
0.65
Chinese
0.59
s
0.58
Japanese
0.57
inese
0.57
Swiss
0.56
Irish
0.56
ese
0.55
оригіналу
0.55
Activations Density 0.030%