INDEX
Explanations
past tense verbs ending in 'ed'
expressions of past experiences and actions
New Auto-Interp
Negative Logits
etheless
-0.77
patch
-0.64
MacArthur
-0.62
device
-0.62
conservancy
-0.61
robat
-0.61
idon
-0.61
align
-0.60
atin
-0.60
Dru
-0.60
POSITIVE LOGITS
marginally
0.80
ONE
0.79
spor
0.78
ifiable
0.70
scratched
0.66
fraction
0.65
hig
0.65
temporary
0.65
ered
0.63
anke
0.63
Activations Density 0.164%