INDEX
Explanations
phrases indicating a binary decision or a choice between two possibilities
conditional phrases indicating uncertainty or indecision
New Auto-Interp
Negative Logits
çīĪ
-0.71
ĸļ
-0.69
Rooms
-0.67
ãĤ¼ãĤ¦ãĤ¹
-0.66
äºĶ
-0.66
083
-0.65
¿
-0.65
ãĥķãĤ©
-0.65
SourceFile
-0.65
srfAttach
-0.62
POSITIVE LOGITS
technically
0.84
existed
0.81
swayed
0.77
qualifies
0.77
theless
0.76
mete
0.72
necessarily
0.71
they
0.71
qualified
0.69
igham
0.68
Activations Density 0.018%