INDEX
Explanations
phrases indicating belief or perception
the verb "to be" in various forms and contexts in the text
New Auto-Interp
Negative Logits
Scroll
-0.80
pedia
-0.74
Salon
-0.73
Verge
-0.69
ilst
-0.66
Gaul
-0.66
Britons
-0.66
HuffPost
-0.63
resorts
-0.63
Investigative
-0.60
POSITIVE LOGITS
gonna
1.29
ELF
0.94
funny
0.94
okay
0.89
going
0.85
ok
0.85
happening
0.84
leeve
0.81
wrong
0.80
gotta
0.79
Activations Density 0.216%