INDEX
Explanations
the concept of absence or non-existence
instances where the word "none" is emphasized
New Auto-Interp
Negative Logits
illon
-0.76
roc
-0.68
rod
-0.68
orate
-0.67
orf
-0.67
oral
-0.65
atro
-0.64
odium
-0.64
urry
-0.64
bledon
-0.62
POSITIVE LOGITS
theless
0.83
uther
0.77
etheless
0.75
endif
0.75
none
0.74
thereof
0.74
else
0.71
igham
0.67
ONSORED
0.66
none
0.65
Activations Density 0.006%