INDEX
Explanations
numerical values appearing in a structured format
measurements and quantities in various contexts
New Auto-Interp
Negative Logits
phrine
-0.94
Halls
-0.82
theless
-0.73
deen
-0.66
GOODMAN
-0.65
hower
-0.64
thirds
-0.63
Galile
-0.62
kitchens
-0.61
Rumble
-0.60
POSITIVE LOGITS
icago
0.97
.,
0.95
./
0.82
........
0.81
ickr
0.80
Avg
0.78
hered
0.77
emp
0.77
nyder
0.76
iances
0.75
Activations Density 0.023%