INDEX
Explanations
instances where the word 'nearly' is used before a number
phrases that indicate approximate quantities or degrees of frequency
New Auto-Interp
Negative Logits
oris
-0.80
agate
-0.76
Reviewer
-0.76
Provision
-0.70
Dynamics
-0.69
ãĤ¸
-0.67
ieu
-0.67
Reward
-0.67
locality
-0.66
ysis
-0.65
POSITIVE LOGITS
arser
0.75
PsyNetMessage
0.72
identical
0.71
tripled
0.71
ceed
0.68
anus
0.67
ident
0.65
doubled
0.64
MpServer
0.63
200
0.63
Activations Density 0.023%