INDEX
Explanations
phrases related to historical events and scholarly discourse
references to historical events or figures
New Auto-Interp
Negative Logits
surgeries
-0.60
perimeter
-0.60
isSpecialOrderable
-0.56
goodwill
-0.52
FILE
-0.50
patient
-0.50
broom
-0.50
SHARE
-0.48
wives
-0.48
stret
-0.48
POSITIVE LOGITS
aptly
0.75
quoting
0.72
"â̦
0.63
""
0.61
"[
0.58
"...
0.58
sarcast
0.56
Rue
0.56
distribut
0.55
vertisement
0.54
Activations Density 0.715%