INDEX
Explanations
verbs in the past tense
phrases indicating choices or decisions
New Auto-Interp
Negative Logits
ench
-0.70
lat
-0.68
resa
-0.66
atha
-0.65
ciating
-0.64
amina
-0.63
clad
-0.63
dimension
-0.63
anon
-0.62
esc
-0.62
POSITIVE LOGITS
decided
0.97
decides
0.86
Ń·
0.84
decide
0.82
ãģ®éŃĶ
0.81
unanimously
0.78
upon
0.74
Xiaomi
0.72
($)
0.71
à©
0.71
Activations Density 0.017%