INDEX
Explanations
names of individuals
phrases indicating missed opportunities or important observations
New Auto-Interp
Negative Logits
ternity
-0.60
rontal
-0.59
mercial
-0.58
zar
-0.57
pend
-0.57
artisan
-0.56
imsy
-0.55
mop
-0.54
Tot
-0.54
yes
-0.54
POSITIVE LOGITS
?:
0.93
!?
0.71
]:
0.70
entails
0.68
nutshell
0.68
?
0.67
%:
0.66
Story
0.66
\":
0.66
boils
0.66
Activations Density 0.340%