INDEX
Explanations
Japanese names, primarily with "iko" at the end
mentions of individuals, particularly in a Japanese context
New Auto-Interp
Negative Logits
mistaken
-0.73
partnerships
-0.72
roads
-0.68
drawn
-0.65
misunderstanding
-0.64
Correct
-0.63
marrow
-0.62
miscar
-0.62
campuses
-0.62
reconciliation
-0.61
POSITIVE LOGITS
iko
1.21
ichi
0.94
omi
0.93
imura
0.86
itsu
0.85
oko
0.84
obi
0.83
oto
0.83
ira
0.82
iro
0.81
Activations Density 0.007%