INDEX
Explanations
specific characters or character sequences that match a particular pattern
special characters and symbols typically used in non-English scripts or encodings
New Auto-Interp
Negative Logits
Heath
-0.72
SPONSORED
-0.67
Starr
-0.66
Daly
-0.66
Hyde
-0.65
Hastings
-0.63
ORED
-0.62
Faw
-0.62
Barton
-0.61
Worldwide
-0.61
POSITIVE LOGITS
Ð
1.49
Ŀ
1.31
±
1.19
ļ
1.19
Ķ
1.18
¹
1.17
ł
1.15
ij
1.13
Ĺ
1.13
Ł
1.12
Activations Density 0.004%