INDEX
Explanations
specific measurements related to physical attributes or quantities
quantitative descriptions and statistics
New Auto-Interp
Negative Logits
apego
-0.63
oaded
-0.54
orsi
-0.53
inx
-0.51
ient
-0.51
oppable
-0.51
enegger
-0.50
uilt
-0.50
milo
-0.50
rehensive
-0.50
POSITIVE LOGITS
guise
0.83
context
0.82
vicinity
0.79
nutshell
0.75
manner
0.75
mode
0.73
circles
0.72
midst
0.69
contexts
0.67
vein
0.67
Activations Density 1.035%