INDEX
Negative Logits
estern
-0.75
anu
-0.75
ãĥ´
-0.67
acting
-0.65
plex
-0.64
entimes
-0.64
ãĥĩ
-0.63
Unch
-0.62
mental
-0.62
ãĥ¼ãĥĨ
-0.60
POSITIVE LOGITS
"...
0.97
"â̦
0.90
:"
0.90
"[
0.87
""
0.84
"'
0.82
"#
0.81
:
0.78
".
0.78
"(
0.77
Activations Density 0.116%