INDEX
Explanations
verbs related to actions of taking, such as "takes," "took," "taking," and "taken."
New Auto-Interp
Negative Logits
lite
-0.69
raid
-0.69
è¦ļéĨĴ
-0.67
ILCS
-0.65
gian
-0.64
eous
-0.62
lex
-0.62
Conclusion
-0.60
arie
-0.60
constitu
-0.60
POSITIVE LOGITS
advantage
1.16
refuge
0.96
pains
0.95
aim
0.94
um
0.85
charge
0.83
place
0.82
part
0.82
inspiration
0.82
aback
0.81
Activations Density 0.074%