INDEX
Explanations
references to mathematical concepts like "square" with some variation in intensities
references to measurements or areas defined in square units
New Auto-Interp
Negative Logits
alcohol
-0.66
glers
-0.65
Cook
-0.65
ILA
-0.64
Recomm
-0.64
Daughter
-0.63
itiz
-0.62
started
-0.62
Acting
-0.62
Addiction
-0.62
POSITIVE LOGITS
square
4.08
square
2.88
squares
2.78
sq
2.36
Square
2.33
Square
2.14
squared
2.10
cubic
1.65
sq
1.60
rectangular
1.32
Activations Density 0.015%