INDEX
Explanations
measurements or quantities indicated by a number followed by a unit of measurement
numerical data related to quantities and measurements
New Auto-Interp
Negative Logits
£ı
-0.68
Reviewer
-0.65
Exodus
-0.64
vanity
-0.64
liction
-0.62
thesis
-0.62
vain
-0.60
fame
-0.60
Heller
-0.60
arena
-0.59
POSITIVE LOGITS
½
1.26
½
0.99
90
0.95
PO
0.91
500
0.90
200
0.89
¢
0.83
800
0.83
472
0.83
bp
0.83
Activations Density 0.188%