INDEX
Explanations
phrases that express emotions, achievements, or significant moments related to personal experiences
New Auto-Interp
Negative Logits
igin
-0.16
ije
-0.15
apon
-0.15
igi
-0.14
//{{-0.14
aña
-0.14
uito
-0.13
кÑĤ
-0.13
edio
-0.13
IGO
-0.13
POSITIVE LOGITS
[
0.20
[s
0.16
[$
0.15
[â̦]
0.15
['
0.15
[,]
0.15
den
0.14
lash
0.14
[%
0.14
ï¼»
0.13
Activations Density 0.478%