INDEX
Explanations
the word "reality" and words that can be contrasted with it like "everywhere" or "tomorrow"
New Auto-Interp
Negative Logits
"])
-1.29
ویکیپدیای
-1.28
varandra
-1.27
itſelf
-1.26
للمعارف
-1.25
vician
-1.22
']")
-1.21
$.
-1.20
.)}
-1.20
".
-1.20
POSITIVE LOGITS
.
0.78
form
0.67
,
0.63
dom
0.62
di
0.61
pos
0.61
som
0.60
em
0.59
sal
0.58
\
0.57
Activations Density 1.519%