INDEX
Explanations
Greek letters and symbols
special characters and symbols, particularly those that resemble currency or mathematical notation
New Auto-Interp
Negative Logits
poaching
-0.75
Lauder
-0.74
wildlife
-0.74
blacklist
-0.72
Brow
-0.69
orno
-0.69
timely
-0.68
iage
-0.67
hower
-0.67
drawer
-0.66
POSITIVE LOGITS
ο
2.11
ÏĦ
2.08
Ï
2.06
Î
2.06
α
2.05
κ
2.02
λ
2.01
ν
2.01
ι
1.98
Ïģ
1.96
Activations Density 0.025%