INDEX
Explanations
names related to Japanese culture
references to specific individuals, particularly names
New Auto-Interp
Negative Logits
Scotia
-0.80
neut
-0.68
Commonwealth
-0.65
ICE
-0.61
Capitals
-0.61
eering
-0.60
Unic
-0.60
RCMP
-0.60
Redux
-0.60
Rebels
-0.59
POSITIVE LOGITS
amoto
1.74
azaki
1.62
amura
1.45
imura
1.43
ihara
1.39
asaki
1.39
imoto
1.38
ikawa
1.32
akura
1.24
aido
1.21
Activations Density 0.038%