INDEX
Explanations
conditions or hypothetical situations described with an "if" clause
conditional statements
New Auto-Interp
Negative Logits
emis
-0.62
tten
-0.59
Born
-0.58
assadors
-0.53
ANI
-0.51
AMY
-0.51
atari
-0.51
tained
-0.51
agnetic
-0.51
atro
-0.50
POSITIVE LOGITS
if
2.91
unless
2.02
if
1.90
If
1.63
IF
1.58
If
1.58
unless
1.54
whether
1.40
whenever
1.34
depending
1.33
Activations Density 0.102%