INDEX
Explanations
phrases related to meeting a certain standard or expectation
references to a mark or notable achievement
New Auto-Interp
Negative Logits
ILLE
-0.78
æ©Ł
-0.67
amily
-0.63
foreseen
-0.60
traged
-0.59
eger
-0.59
Franch
-0.59
amera
-0.59
aband
-0.58
pand
-0.58
POSITIVE LOGITS
manship
1.32
emark
1.00
downs
0.97
down
0.97
Twain
0.83
ename
0.83
marks
0.81
ups
0.79
posts
0.75
vich
0.73
Activations Density 0.020%