INDEX
Explanations
special characters or symbols, particularly slashes followed by numbers
occurrences of slashes
New Auto-Interp
Negative Logits
square
-0.84
breed
-0.83
rebuilt
-0.79
swell
-0.78
densely
-0.77
lifetime
-0.76
geared
-0.75
enclosed
-0.75
scratch
-0.75
squid
-0.74
POSITIVE LOGITS
whatever
1.72
etc
1.69
trans
1.43
deb
1.40
anti
1.39
coll
1.38
acqu
1.38
non
1.38
dist
1.38
super
1.37
Activations Density 0.059%