INDEX
Explanations
phrases related to personal beliefs and life decisions
New Auto-Interp
Negative Logits
chin
-0.74
iple
-0.68
ania
-0.67
coni
-0.67
utive
-0.65
lier
-0.64
chell
-0.62
quartered
-0.62
iny
-0.61
ighed
-0.60
POSITIVE LOGITS
albeit
1.37
regardless
1.18
irrespective
1.11
yeah
1.02
insofar
1.00
anyway
0.98
though
0.97
although
0.97
lest
0.93
preferably
0.93
Activations Density 0.313%