INDEX
Explanations
adjectives and adverbs indicating intensity or degree
expressions indicating ongoing relevance or popularity
New Auto-Interp
Negative Logits
attery
-0.68
assi
-0.67
lez
-0.61
imov
-0.61
erity
-0.61
Mortgage
-0.61
Tesla
-0.60
Advice
-0.59
andum
-0.59
Interstitial
-0.59
POSITIVE LOGITS
virgin
0.76
intact
0.74
untouched
0.73
adolesc
0.73
scratching
0.71
unsolved
0.71
unanswered
0.70
birth
0.70
reeling
0.66
relevance
0.65
Activations Density 0.276%