INDEX
Explanations
phrases related to advocating or raising awareness about various issues
New Auto-Interp
Negative Logits
cas
-0.68
ician
-0.68
treatment
-0.67
pedia
-0.66
Healer
-0.59
dest
-0.58
obs
-0.58
melon
-0.58
Antar
-0.58
nar
-0.57
POSITIVE LOGITS
eyebrows
1.41
eyebrow
1.05
awareness
1.05
suspicions
0.96
alarms
0.94
doubts
0.91
expectations
0.89
stakes
0.88
taxes
0.86
objections
0.83
Activations Density 0.566%