INDEX
Explanations
adjectives describing specific aspects of something
references to different facets or attributes of a subject
New Auto-Interp
Negative Logits
ander
-0.82
anders
-0.72
usable
-0.72
ettings
-0.72
hov
-0.72
andering
-0.71
ricanes
-0.70
annis
-0.68
gencies
-0.68
inflamm
-0.62
POSITIVE LOGITS
SourceFile
0.91
uality
0.86
aspects
0.82
stones
0.81
facets
0.80
iveness
0.76
lihood
0.74
ality
0.73
stone
0.73
rait
0.71
Activations Density 0.010%