INDEX
Explanations
numerical figures, symbols, and formatting elements in text
text segments with special characters or formatting symbols
New Auto-Interp
Negative Logits
seless
-0.77
acles
-0.75
fireplace
-0.74
ahime
-0.73
inator
-0.71
unia
-0.70
Beir
-0.69
nesses
-0.68
anse
-0.68
liness
-0.64
POSITIVE LOGITS
(*
0.81
(*
0.79
ERROR
0.76
STD
0.74
STDOUT
0.73
testing
0.73
TEXT
0.71
catentry
0.70
Thompson
0.69
FREE
0.69
Activations Density 0.025%