INDEX
Explanations
characters that are not part of standard English text
special characters or non-standard symbols in the text
New Auto-Interp
Negative Logits
alion
-0.76
icter
-0.72
Norn
-0.71
iple
-0.68
aimon
-0.67
plaque
-0.65
apers
-0.65
atre
-0.65
ulhu
-0.65
ified
-0.64
POSITIVE LOGITS
¢
1.04
ÃĤ
1.00
©
0.95
âĢ¢âĢ¢
0.89
£
0.83
âĤ¬
0.81
··
0.80
Johnson
0.79
¯¯
0.79
§
0.77
Activations Density 0.008%