INDEX
Explanations
phrases related to conditional statements
conditional statements or phrases indicating a hypothetical situation
New Auto-Interp
Negative Logits
abre
-0.46
forth
-0.45
Crusade
-0.43
Ire
-0.43
�
-0.41
multitude
-0.41
hallmark
-0.41
herry
-0.41
Enh
-0.41
Hun
-0.40
POSITIVE LOGITS
rame
1.08
ihad
0.98
you
0.88
ornia
0.86
fy
0.68
thou
0.66
anybody
0.63
you
0.63
(!
0.63
ram
0.63
Activations Density 0.120%