INDEX
Explanations
large numerical values, specifically the word "thousand"
references to the phrase "a thousand."
New Auto-Interp
Negative Logits
odcast
-0.88
akening
-0.88
livious
-0.85
enture
-0.82
rica
-0.82
NetMessage
-0.81
inion
-0.79
regon
-0.79
untu
-0.78
enhagen
-0.76
POSITIVE LOGITS
oxy
0.81
snakes
0.72
yen
0.67
Ake
0.65
injection
0.64
stripes
0.64
lions
0.63
Okin
0.63
cubic
0.62
miles
0.62
Activations Density 0.029%