INDEX
Explanations
adjectives describing intensity or completeness
words associated with completeness or fullness
New Auto-Interp
Negative Logits
apego
-0.77
encer
-0.72
apter
-0.71
ndra
-0.70
tis
-0.70
Flavoring
-0.69
vous
-0.68
Founders
-0.67
ourge
-0.67
ramid
-0.67
POSITIVE LOGITS
increments
1.06
guise
0.95
fashion
0.94
manner
0.91
circles
0.90
proportions
0.89
stead
0.88
quantities
0.87
terms
0.86
arenas
0.86
Activations Density 0.177%