INDEX
Explanations
references to the word "first" in various contexts
New Auto-Interp
Negative Logits
reszcie
-0.71
%)$
-0.71
rrggbb
-0.68
Winaray
-0.64
PyExc
-0.60
finally
-0.60
lastly
-0.59
rawDesc
-0.59
exaggeration
-0.57
hésite
-0.56
POSITIVE LOGITS
few
0.83
born
0.81
responders
0.80
thing
0.79
aider
0.78
aid
0.77
impression
0.74
ever
0.73
Aid
0.73
glance
0.73
Activations Density 0.150%