INDEX
Explanations
conflicting concepts or opposing ideas in a text
themes related to conflict and contrasting desires for unity versus separation
New Auto-Interp
Negative Logits
Jaguar
-0.81
enthusi
-0.68
Antar
-0.67
Nar
-0.64
XD
-0.60
Gow
-0.59
EMS
-0.59
Wik
-0.58
etc
-0.58
Kro
-0.57
POSITIVE LOGITS
roying
0.74
èĥ
0.73
readiness
0.65
ield
0.64
isl
0.63
corresponding
0.62
actual
0.60
lifting
0.60
fulfilling
0.59
idy
0.58
Activations Density 0.724%