INDEX
Explanations
discoveries, controversies, and events reported in texts
punctuation marks and symbols at the end of sentences or questions
New Auto-Interp
Negative Logits
lish
-0.71
ilee
-0.68
treasury
-0.64
Gur
-0.64
tenant
-0.62
arded
-0.62
ified
-0.59
cour
-0.59
eyeb
-0.59
yright
-0.59
POSITIVE LOGITS
pmwiki
0.86
ï¸
0.84
BILITIES
0.82
[+
0.81
ĸļ
0.80
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.78
Unloaded
0.78
Parables
0.74
âķ
0.74
è¦ļéĨĴ
0.73
Activations Density 0.065%