INDEX
Explanations
conditional statements starting with "Had"
occurrences of the word "Had" in various contexts
New Auto-Interp
Negative Logits
outp
-0.65
FTWARE
-0.64
scrimmage
-0.62
sniper
-0.59
Rated
-0.58
repay
-0.58
âϦ
-0.57
embr
-0.57
juggling
-0.57
salute
-0.56
POSITIVE LOGITS
iths
1.06
rons
0.92
ith
0.91
hers
0.90
luck
0.87
nesday
0.84
been
0.82
rontal
0.82
dit
0.82
alus
0.80
Activations Density 0.067%