INDEX
Explanations
phrases starting with "Last but not least"
phrases or constructs that include the word "but."
New Auto-Interp
Negative Logits
tnc
-0.64
Rolls
-0.61
ige
-0.61
Liberties
-0.60
rongh
-0.60
naire
-0.59
oire
-0.58
agra
-0.56
ories
-0.56
Combine
-0.56
POSITIVE LOGITS
tons
1.25
chery
1.20
chers
0.93
ler
0.88
alas
0.87
LER
0.83
tered
0.82
tery
0.81
tern
0.79
luckily
0.78
Activations Density 0.173%