INDEX
Explanations
phrases related to the capabilities or potential of entities or products
phrases indicating capabilities and the impact of actions or events
New Auto-Interp
Negative Logits
estern
-0.68
onet
-0.61
Vil
-0.59
noticed
-0.58
Hills
-0.56
surn
-0.56
Diesel
-0.56
urinary
-0.56
uclear
-0.55
briefs
-0.55
POSITIVE LOGITS
entail
0.99
pires
0.84
entails
0.81
ĸļ
0.69
SourceFile
0.68
.(
0.66
incial
0.65
ablishment
0.65
purported
0.65
consist
0.65
Activations Density 0.203%