INDEX
Explanations
phrases indicating separation or distinction
New Auto-Interp
Negative Logits
dotenv
-0.82
orges
-0.68
SharedCtor
-0.67
type
-0.65
o
-0.64
styleType
-0.62
e
-0.62
volles
-0.61
type
-0.61
brahim
-0.59
POSITIVE LOGITS
apart
2.02
apart
1.86
Apart
1.67
APART
1.50
Apart
1.49
aside
1.43
Aside
1.32
aside
1.31
Aside
1.31
appart
1.13
Activations Density 0.077%