INDEX
Explanations
dates in a specific format
sequences of letters and characters, particularly patterns in titles or acronyms
New Auto-Interp
Negative Logits
Materials
-0.64
brim
-0.63
poke
-0.63
caps
-0.62
dos
-0.61
tip
-0.61
Bloomberg
-0.61
Flavoring
-0.60
Shape
-0.60
picture
-0.59
POSITIVE LOGITS
hyde
0.70
oire
0.69
.:
0.67
culosis
0.67
.,
0.66
.?
0.65
ament
0.65
eteria
0.62
acus
0.61
.—
0.61
Activations Density 0.071%