INDEX
Explanations
dates and numerical figures expressed in a specific format
numerical data and statistics
New Auto-Interp
Negative Logits
fateful
-0.72
nomine
-0.69
shopping
-0.68
sidew
-0.68
corrid
-0.67
bunny
-0.67
thous
-0.66
tremend
-0.65
agna
-0.63
canned
-0.63
POSITIVE LOGITS
307
0.94
MQ
0.93
285
0.92
88
0.91
245
0.91
195
0.90
287
0.90
665
0.89
284
0.88
484
0.88
Activations Density 0.144%