INDEX
Explanations
phrases related to being a part of something
references to components of a whole or contributions to a larger context
New Auto-Interp
Negative Logits
destro
-0.79
mare
-0.70
asu
-0.67
avorite
-0.66
ceilings
-0.65
bish
-0.64
metab
-0.63
incinn
-0.63
olesc
-0.63
usra
-0.62
POSITIVE LOGITS
ials
0.87
ially
0.85
aking
0.75
ners
0.75
uary
0.72
ioned
0.72
meal
0.71
adjunct
0.70
part
0.68
icularly
0.62
Activations Density 0.027%