INDEX
Explanations
special characters and symbols followed by a colon
New Auto-Interp
Negative Logits
conversions
-0.77
constit
-0.75
conversion
-0.69
divest
-0.67
corpus
-0.66
distinguished
-0.65
seiz
-0.65
membership
-0.65
succession
-0.65
lifetime
-0.64
POSITIVE LOGITS
-)
1.24
lol
1.13
/)
1.12
_>
1.08
)"
1.08
laugh
1.08
DD
1.07
rolley
1.03
D
1.02
*=-
1.01
Activations Density 0.081%