INDEX
Explanations
references to specific quantities or levels
phrases that consistently include the article "a" along with other references to specific contexts or situations
New Auto-Interp
Negative Logits
onis
-0.73
APTER
-0.70
apters
-0.69
eur
-0.67
Aires
-0.67
ORTS
-0.66
imaru
-0.64
tt
-0.64
osion
-0.63
Ending
-0.61
POSITIVE LOGITS
glance
1.25
distance
1.00
rate
0.90
snail
0.88
disadvantage
0.86
time
0.86
cost
0.84
minimum
0.82
discounted
0.81
fraction
0.81
Activations Density 0.050%