INDEX
Explanations
numbers and values expressed in terms of percentage points
references to various percentage figures
New Auto-Interp
Negative Logits
enegger
-0.62
accompanied
-0.62
afort
-0.62
oust
-0.62
facult
-0.62
messenger
-0.61
lavish
-0.60
efer
-0.60
Poster
-0.60
Anon
-0.59
POSITIVE LOGITS
points
1.45
points
1.27
point
1.26
point
1.26
Points
1.05
pts
1.03
Points
1.03
Point
0.95
pointers
0.91
Celsius
0.90
Activations Density 0.026%