INDEX
Explanations
instances of the verb "take" and its variations in different contexts
New Auto-Interp
Negative Logits
fad
-0.16
yen
-0.16
ment
-0.15
eter
-0.15
ala
-0.15
ame
-0.15
acam
-0.14
ader
-0.14
iggs
-0.14
iff
-0.14
POSITIVE LOGITS
chances
0.26
risks
0.25
risk
0.18
Chance
0.17
Chance
0.17
_chance
0.17
rá»§i
0.16
Ris
0.16
ebek
0.16
shortcuts
0.15
Activations Density 0.053%