INDEX
Explanations
phrases indicating events or actions that have occurred or been announced by a specific entity
the repeated use of the word "this" in various contexts
New Auto-Interp
Negative Logits
tops
-0.69
worms
-0.66
doms
-0.66
masters
-0.65
76561
-0.64
Sword
-0.63
okers
-0.62
ovy
-0.62
ãĤ¹ãĥĪ
-0.60
©¶æ¥µ
-0.59
POSITIVE LOGITS
week
1.02
month
0.91
year
0.90
morning
0.86
weekend
0.86
latest
0.85
afternoon
0.83
century
0.80
particular
0.79
decade
0.77
Activations Density 0.248%