INDEX
Explanations
descriptive adjectives
repeated phrases or fragments in the text
New Auto-Interp
Negative Logits
EStream
-0.97
ulla
-0.87
regation
-0.86
phia
-0.84
ãĤ¨ãĥ«
-0.79
iatrics
-0.79
onica
-0.75
lication
-0.73
çīĪ
-0.72
apor
-0.70
POSITIVE LOGITS
informative
1.03
unpredictable
1.02
insightful
1.01
witty
0.97
inefficient
0.97
albeit
0.96
efficient
0.96
resilient
0.94
humorous
0.93
wasteful
0.92
Activations Density 0.209%