INDEX
Explanations
characters and elements from fictional narratives and scientific contexts, particularly focusing on themes of transformation and identity
New Auto-Interp
Negative Logits
kasarigan
-0.80
UnusedPrivate
-0.70
principalTable
-0.64
Kariera
-0.64
LikeLiked
-0.60
arşivlendi
-0.57
abancı
-0.56
saites
-0.56
τρό
-0.55
برانيه
-0.55
POSITIVE LOGITS
HAM
1.17
SPM
1.15
BEM
1.14
CSM
1.13
DPM
1.13
HAM
1.12
LAM
1.12
JAM
1.10
ACM
1.09
Dm
1.09
Activations Density 6.004%