INDEX
Explanations
references to previous experiences or comparisons over time
New Auto-Interp
Negative Logits
icken
-0.79
bb
-0.69
fw
-0.69
bard
-0.69
okers
-0.69
motion
-0.68
okes
-0.68
externalActionCode
-0.68
tip
-0.68
mouth
-0.67
POSITIVE LOGITS
previous
1.04
usual
0.91
norm
0.83
baseline
0.80
whence
0.80
traditional
0.80
typical
0.78
scrimmage
0.77
earlier
0.75
conventional
0.74
Activations Density 0.042%