INDEX
    Explanations

    content related to scientific measurements or analysis

    after specific nouns

    New Auto-Interp
    Negative Logits
     للاسماء
    -0.45
    rosis
    -0.45
     &___
    -0.44
     Signalez
    -0.43
     상세
    -0.43
     تكبرها
    -0.43
    🟤
    -0.43
    ươi
    -0.42
     corações
    -0.42
     <>",
    -0.42
    POSITIVE LOGITS
     beiden
    0.98
    どちらも
    0.98
     both
    0.96
     ambos
    0.93
     Both
    0.92
    Both
    0.90
    both
    0.90
     entrambi
    0.89
     beide
    0.86
     begge
    0.85
    Act Density 1.018%

    No Known Activations