INDEX
Explanations
conditional statements related to taking an action or making a decision
phrases that express taking action or making decisions
New Auto-Interp
Negative Logits
holm
-0.67
Cong
-0.67
agre
-0.66
gian
-0.64
ingen
-0.63
eers
-0.63
Smile
-0.61
eman
-0.60
Kamp
-0.59
Domain
-0.58
POSITIVE LOGITS
advantage
1.18
aways
1.13
heed
0.96
aback
0.90
care
0.88
refuge
0.83
autions
0.82
overs
0.81
OVER
0.80
precedence
0.79
Activations Density 0.120%