INDEX
Explanations
phrases that describe actions or sentiments characterized as significant or shocking
New Auto-Interp
Negative Logits
MarshalTo
-0.55
:+:
-0.48
WSGI
-0.46
IVersion
-0.41
بوابة
-0.41
InstanceState
-0.41
jsPsych
-0.40
pant
-0.40
obox
-0.40
spender
-0.40
POSITIVE LOGITS
"..
0.69
"...
0.64
“…
0.63
“...
0.63
]="
0.61
Describes
0.60
“(
0.59
“[
0.59
LEncoder
0.59
"[
0.59
Activations Density 0.404%