INDEX
Explanations
abbreviations or acronyms ending in "DF"
references to specific defense forces or emergency services
New Auto-Interp
Negative Logits
stakes
-0.64
felt
-0.64
car
-0.63
Metatron
-0.63
ographed
-0.62
vo
-0.62
McCartney
-0.62
sole
-0.60
cort
-0.60
parents
-0.60
POSITIVE LOGITS
DF
1.20
amily
0.96
WD
0.96
DM
0.93
avorite
0.92
sg
0.88
rost
0.86
RAG
0.85
RF
0.84
arlane
0.82
Activations Density 0.007%