INDEX
Explanations
numbers and monetary values, especially with decimals
numerical data and figures related to quantities or statistics
New Auto-Interp
Negative Logits
shopping
-0.65
succ
-0.63
direction
-0.60
fren
-0.60
puff
-0.59
subp
-0.58
tremend
-0.58
hog
-0.57
sworn
-0.57
psyche
-0.57
POSITIVE LOGITS
307
0.87
284
0.87
285
0.85
498
0.84
396
0.84
665
0.84
657
0.83
384
0.83
659
0.82
398
0.81
Activations Density 0.158%