INDEX
Explanations
references to specific numbered items or entities within a list
special character patterns or formatting elements
New Auto-Interp
Negative Logits
Pioneer
-0.80
Cran
-0.65
courier
-0.65
Paradise
-0.64
Patrol
-0.62
iens
-0.61
seller
-0.60
Polar
-0.59
Anglo
-0.59
Evangel
-0.58
POSITIVE LOGITS
###
4.48
####
2.77
##
2.64
###
2.36
########
2.24
################
1.92
##
1.72
################################
1.60
#####
1.55
#
1.37
Activations Density 0.033%