INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ']}
    -0.79
    Tikang
    -0.77
    "}")
    -0.77
    ')}
    -0.75
    ])]
    -0.70
    '];?>
    -0.68
    ')]
    -0.65
    ']]
    -0.64
    ');?>
    -0.64
    ")}
    -0.63
    POSITIVE LOGITS
     Monfieur
    0.81
     isolato
    0.70
     Altri
    0.69
     purpoſe
    0.69
     celib
    0.66
     Jefus
    0.65
     sacré
    0.65
     Aristote
    0.63
     elettrico
    0.63
     بيها
    0.62
    Act Density 0.010%

    No Known Activations