INDEX
Explanations
phrases related to emotional or reflective experiences
Sentences followed by conjunctions or continuations
possibilities and questions
New Auto-Interp
Negative Logits
(;;)
-0.78
saraba
-0.64
клопе
-0.63
glGen
-0.63
ungeon
-0.61
wikipagina
-0.61
?】
-0.60
Cited
-0.60
THISDAY
-0.59
/>";
-0.59
POSITIVE LOGITS
kasarigan
0.63
چقدر
0.57
impost
0.56
Damn
0.51
Thinking
0.51
damn
0.50
maybe
0.50
thought
0.49
ImageContext
0.49
should
0.48
Activations Density 0.107%