INDEX
Explanations
adjectives or nouns related to depth or intensity
phrases that convey depth or intensity of emotion or experience
New Auto-Interp
Negative Logits
oppable
-0.77
orious
-0.71
icans
-0.69
ULE
-0.67
roma
-0.66
annon
-0.66
uthor
-0.66
Ĥİ
-0.65
EED
-0.65
Frames
-0.64
POSITIVE LOGITS
vein
1.05
ened
1.01
pockets
0.95
seeded
0.92
penetration
0.88
breaths
0.87
ening
0.85
dive
0.84
rooted
0.82
deep
0.81
Activations Density 0.027%