INDEX
Explanations
the term "important" and its variations, indicating a focus on significant concepts or elements within the text
New Auto-Interp
Negative Logits
uffers
-0.16
913
-0.15
onica
-0.14
onical
-0.14
czy
-0.14
á»ĵn
-0.14
_preferences
-0.14
æĦ
-0.14
criptor
-0.13
754
-0.13
POSITIVE LOGITS
antly
0.20
ölçüde
0.18
ost
0.16
ially
0.16
/key
0.16
ξη
0.15
-league
0.15
pants
0.15
جدا
0.15
/use
0.15
Activations Density 0.041%