INDEX
Explanations
phrases indicating frequency or repetition
New Auto-Interp
Negative Logits
reesome
-0.16
.scalablytyped
-0.15
alytics
-0.15
ải
-0.14
edik
-0.14
ارک
-0.14
æĭĵ
-0.14
dif
-0.14
paged
-0.13
angent
-0.13
POSITIVE LOGITS
awhile
0.36
blue
0.30
while
0.29
blue
0.26
Blue
0.25
BLUE
0.24
BLUE
0.23
-blue
0.23
while
0.23
Blue
0.23
Activations Density 0.031%