INDEX
Explanations
comparative adjectives describing the degree or intensity of something
comparative phrases emphasizing significance or frequency
New Auto-Interp
Negative Logits
ĺħ
-0.69
¬¼
-0.63
çļ
-0.61
etts
-0.58
}}}
-0.58
mberg
-0.57
ritz
-0.57
sembly
-0.55
odor
-0.54
osate
-0.54
POSITIVE LOGITS
than
2.43
than
2.29
Than
1.91
Th
0.89
TH
0.85
then
0.78
then
0.76
THEN
0.76
worldly
0.72
besides
0.66
Activations Density 0.521%