INDEX
Explanations
phrases related to opinions or perspectives
phrases related to public sentiment and policy discussion
New Auto-Interp
Negative Logits
Flavoring
-0.71
éŃĶ
-0.66
kefeller
-0.64
ickets
-0.60
Hau
-0.59
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.57
onductor
-0.57
fibre
-0.56
racuse
-0.56
Merit
-0.56
POSITIVE LOGITS
!?
0.81
????????
0.81
????
0.80
?!
0.76
???
0.76
âĶĢâĶĢâĶĢâĶĢ
0.75
huh
0.73
...?
0.73
plet
0.71
"?
0.71
Activations Density 1.122%