INDEX
Explanations
phrases indicating putting an end or taking action on something
articles preceding nouns
New Auto-Interp
Negative Logits
assisted
-0.70
!/
-0.69
endeavors
-0.63
preferring
-0.61
incorpor
-0.60
peak
-0.60
lins
-0.59
mins
-0.58
pursuits
-0.58
Easy
-0.58
POSITIVE LOGITS
lot
1.21
bunch
1.12
handful
0.98
huge
0.89
few
0.89
sizable
0.87
sizeable
0.85
bead
0.84
piece
0.83
couple
0.82
Activations Density 0.211%