INDEX
Explanations
specific Japanese characters or phrases related to emotions and states of being
New Auto-Interp
Negative Logits
in
-0.75
mxArray
-0.70
ValueStyle
-0.61
국의
-0.57
étape
-0.57
in
-0.56
a
-0.55
a
-0.54
from
-0.54
thenia
-0.53
POSITIVE LOGITS
Reſ
1.09
Inſ
1.02
Conſ
1.01
Anſ
1.00
Diſ
0.96
Efq
0.95
Houſe
0.91
myſelf
0.90
Perſ
0.87
ſtate
0.87
Activations Density 0.274%