INDEX
Explanations
Japanese names, potentially related to locations or individuals
the names of individuals associated with a specific context or event
New Auto-Interp
Negative Logits
Rudd
-0.66
Lynd
-0.65
markup
-0.64
Redux
-0.63
residual
-0.62
pollut
-0.61
pasture
-0.61
forwards
-0.59
straw
-0.58
view
-0.57
POSITIVE LOGITS
achi
4.67
iba
1.40
hiba
1.28
achu
1.20
uku
1.13
agi
1.08
aeda
1.07
ichi
1.03
anka
1.01
atsu
1.01
Activations Density 0.007%