INDEX
Explanations
phrases where someone is convinced of something
phrases focused on strong beliefs or convictions
New Auto-Interp
Negative Logits
dos
-0.72
launch
-0.70
orrow
-0.67
Occupations
-0.67
regulated
-0.66
pmwiki
-0.66
umbn
-0.64
asks
-0.63
ember
-0.63
itage
-0.63
POSITIVE LOGITS
convinced
1.23
persuaded
1.00
convince
0.97
-+-+
0.92
convin
0.90
persuade
0.82
lured
0.79
ãĥ¼ãĥĨ
0.78
ãĥ³ãĤ¸
0.75
rican
0.73
Activations Density 0.011%