INDEX
Explanations
references to academic studies and research
New Auto-Interp
Negative Logits
Rapids
-0.68
Pole
-0.63
Fiesta
-0.62
roots
-0.61
riot
-0.60
groceries
-0.60
lookout
-0.60
cakes
-0.59
gets
-0.58
airst
-0.58
POSITIVE LOGITS
conducted
1.15
involving
1.03
performed
1.02
studies
0.94
examining
0.94
uggest
0.94
undertaken
0.93
published
0.90
evaluating
0.90
comparing
0.88
Activations Density 0.033%