INDEX
Explanations
concepts related to social significance and cultural analysis
Academic or political discourse
academic discourse and concepts
New Auto-Interp
Negative Logits
+#+
-0.77
iſen
-0.69
محفوظة
-0.67
propOrder
-0.65
ésultats
-0.65
$_"
-0.64
pleaſure
-0.64
ConstraintMaker
-0.64
edicated
-0.62
ſchen
-0.60
POSITIVE LOGITS
Gegenstand
0.35
uitgenodigd
0.33
political
0.32
partial
0.32
探讨
0.31
(
0.31
provocative
0.31
立场
0.31
gegenwär
0.31
cited
0.31
Activations Density 0.771%