INDEX
Explanations
company names in text
negations or statements disproving a claim
New Auto-Interp
Negative Logits
Palest
-0.71
photoc
-0.67
çīĪ
-0.65
ilst
-0.62
indo
-0.62
sacrific
-0.61
cryptoc
-0.61
Fukushima
-0.60
Nik
-0.60
mainland
-0.59
POSITIVE LOGITS
¬
1.32
Ń
1.27
±
1.20
«
1.18
ĵ
1.16
ĸ
1.15
ij
1.14
ª
1.13
£
1.12
Ķ
1.12
Activations Density 0.136%