INDEX
Explanations
detailed explanations or descriptions within text
the term "description" and its variations
New Auto-Interp
Negative Logits
nuts
-0.79
ced
-0.69
yrus
-0.68
abb
-0.67
cot
-0.66
enthal
-0.66
acus
-0.66
inth
-0.65
ergic
-0.64
lah
-0.62
POSITIVE LOGITS
descriptions
1.04
description
1.03
REDACTED
0.86
description
0.83
synopsis
0.83
describ
0.82
anguage
0.82
thereof
0.77
specifications
0.77
Description
0.75
Activations Density 0.014%