INDEX
Explanations
self-referential phrases and expressions of sentiment
New Auto-Interp
Negative Logits
MathML
-0.48
Supported
-0.32
supported
-0.32
displaystyle
-0.30
spoken
-0.28
HomeAsUpEnabled
-0.28
Received
-0.28
witnessed
-0.28
bore
-0.27
Supported
-0.26
POSITIVE LOGITS
added
0.99
Added
0.95
added
0.90
modified
0.88
removed
0.87
Added
0.86
selected
0.85
ADDED
0.84
tweaked
0.83
removed
0.82
Activations Density 0.731%