INDEX
Explanations
information related to computer programming and technical issues
terms related to investigations and legal matters
New Auto-Interp
Negative Logits
sane
-0.67
thoughtful
-0.58
recognizes
-0.57
wise
-0.57
civilized
-0.57
careful
-0.56
ciplinary
-0.55
raged
-0.54
iless
-0.54
mindful
-0.53
POSITIVE LOGITS
.''.
1.06
.</
0.97
.''
0.95
'.
0.91
''.
0.91
$.
0.89
`.
0.89
.'
0.87
%.
0.86
".
0.84
Activations Density 1.310%