INDEX
Explanations
phrases related to advantages or positive outcomes
New Auto-Interp
Negative Logits
il
-0.16
rib
-0.15
esp
-0.15
coming
-0.14
wich
-0.14
поÑĩ
-0.14
idelity
-0.14
ie
-0.14
for
-0.14
imar
-0.14
POSITIVE LOGITS
ting
0.21
benefit
0.18
benefited
0.17
GuidId
0.17
RuntimeObject
0.16
Benefit
0.15
enefit
0.15
greatly
0.15
lien
0.15
ãģĹãģ®
0.15
Activations Density 0.023%