INDEX
Explanations
expressions of gratitude and acknowledgment in statements
New Auto-Interp
Negative Logits
>()
-0.67
Xna
-0.66
});*/
-0.65
*/}
-0.64
})`
-0.61
}`;
-0.61
$^{-0.60
'/>
-0.59
*/;
-0.57
}`);
-0.57
POSITIVE LOGITS
@
0.98
@
0.77
#
0.75
.@
0.75
(@
0.74
"@
0.70
/@
0.69
0.68
sic
0.68
f
0.67
Activations Density 0.083%