INDEX
Explanations
phrases that express degrees of quality or value
New Auto-Interp
Negative Logits
åĺī
-0.17
ži
-0.16
imar
-0.15
acz
-0.15
osu
-0.15
agar
-0.14
fabs
-0.14
اÙĦا
-0.14
Somehow
-0.14
allah
-0.13
POSITIVE LOGITS
deal
0.78
Deal
0.65
deal
0.65
Deal
0.59
DEAL
0.54
deals
0.52
dealt
0.49
bit
0.44
Deals
0.44
dealing
0.40
Activations Density 0.036%