INDEX
Explanations
adjectives or verbs indicating value judgments
the verb "is" and its variations in different contexts
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.71
ABE
-0.71
ONSORED
-0.69
¥ŀ
-0.66
ADS
-0.63
Buff
-0.63
ORK
-0.61
Styles
-0.59
banner
-0.59
Ĥª
-0.58
POSITIVE LOGITS
estate
0.90
sel
0.90
quel
0.88
ciation
0.88
cience
0.88
pect
0.87
earch
0.87
sei
0.87
quer
0.87
ceed
0.86
Activations Density 0.051%