INDEX
Explanations
phrases related to movements or actions
specific actions or events related to personal experiences and milestones
New Auto-Interp
Negative Logits
far
-0.66
notwithstanding
-0.66
aver
-0.64
00007
-0.64
inen
-0.62
¶ħ
-0.62
Nanto
-0.61
exacerbated
-0.59
mong
-0.58
among
-0.58
POSITIVE LOGITS
uberty
0.86
enium
0.67
robe
0.65
],"
0.65
ascus
0.62
exams
0.62
pection
0.62
ript
0.62
jen
0.60
azeera
0.60
Activations Density 0.852%