INDEX
Explanations
references to dimensions or measurements
New Auto-Interp
Negative Logits
set
-0.18
land
-0.17
Highest
-0.16
ship
-0.16
lie
-0.16
like
-0.16
ë°©
-0.15
self
-0.15
_intf
-0.15
haven
-0.15
POSITIVE LOGITS
able
0.27
/type
0.23
ToFit
0.23
़ा
0.18
/color
0.18
hetto
0.17
/style
0.17
/types
0.16
abwe
0.16
hint
0.16
Activations Density 0.036%