INDEX
Explanations
phrases indicating superiority or addition
phrases indicating an ascending ranking or hierarchy
New Auto-Interp
Negative Logits
chie
-0.69
ELL
-0.65
Strait
-0.62
isher
-0.62
Breed
-0.58
Bye
-0.57
cci
-0.57
Guinness
-0.57
jab
-0.57
ITED
-0.56
POSITIVE LOGITS
boards
0.87
thereof
0.86
oft
0.86
of
0.83
ography
0.79
most
0.78
mast
0.77
deck
0.76
retty
0.75
Of
0.73
Activations Density 0.022%