INDEX
Explanations
political and societal commentary related phrases
instances of the character "âĢ" and related symbols
New Auto-Interp
Negative Logits
sled
-0.72
Franch
-0.69
Manhattan
-0.63
Paran
-0.63
unmarked
-0.62
isode
-0.61
velvet
-0.60
antip
-0.60
Mutual
-0.59
Cookie
-0.59
POSITIVE LOGITS
¬
1.39
Ļ
1.37
¡
1.25
ľ
1.21
Ĵ
1.19
ĸ
1.15
ı
1.15
µ
1.14
¤
1.11
Ĺ
1.11
Activations Density 0.336%