INDEX
Explanations
the word "if" followed by a numerical value
conditional statements or hypothetical scenarios
New Auto-Interp
Negative Logits
depths
-0.69
omi
-0.65
GMT
-0.64
WAYS
-0.64
================================================================
-0.63
holm
-0.62
nect
-0.61
avor
-0.61
grey
-0.60
ho
-0.59
POSITIVE LOGITS
yip
0.93
fy
0.82
Gutenberg
0.76
thou
0.75
you
0.74
ever
0.69
Melania
0.68
rame
0.67
yon
0.67
soever
0.67
Activations Density 0.033%