INDEX
Explanations
mentions of various forms of the word "press."
New Auto-Interp
Negative Logits
fromnode
-0.73
-0.73
sherds
-0.72
wattpad
-0.67
intStringLen
-0.66
OnTop
-0.65
hulls
-0.64
iprot
-0.64
iprot
-0.64
welds
-0.63
POSITIVE LOGITS
press
0.94
pressing
0.84
Press
0.81
pressing
0.80
Water
0.76
water
0.74
Water
0.73
press
0.72
PRESS
0.67
water
0.66
Activations Density 0.173%