INDEX
Explanations
verbs or nouns denoting significant or impactful actions or concepts
musical or thematic elements and structures within narratives
New Auto-Interp
Negative Logits
BSD
-0.73
phia
-0.72
Charge
-0.70
Masters
-0.69
ADS
-0.69
bilt
-0.65
attraction
-0.65
pire
-0.65
ADA
-0.65
dash
-0.64
POSITIVE LOGITS
oring
1.05
ored
1.00
arding
0.96
oured
0.93
ouring
0.88
roup
0.86
uilding
0.84
orer
0.83
orers
0.82
oning
0.81
Activations Density 0.026%