INDEX
Explanations
phrases indicating a statement or opinion on a topic
phrases that express a statement or assertion
New Auto-Interp
Negative Logits
xtap
-0.85
¥ŀ
-0.76
hooting
-0.75
asu
-0.74
pes
-0.72
phrine
-0.72
mental
-0.71
conflic
-0.71
tele
-0.70
Tai
-0.69
POSITIVE LOGITS
goodbye
1.38
hello
1.03
aloud
0.89
Goodbye
0.87
farewell
0.78
ysis
0.74
ieu
0.71
definitively
0.69
sorry
0.68
ings
0.64
Activations Density 0.049%