INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     échelle
    -0.40
     curiosidad
    -0.35
    wohl
    -0.35
     cera
    -0.35
    NameInMap
    -0.34
    Judging
    -0.34
     judging
    -0.33
     époque
    -0.32
     Judging
    -0.32
     murs
    -0.31
    POSITIVE LOGITS
    -->
    1.74
     -->
    1.68
    -->
    
    1.38
     --->
    1.34
     →
    1.34
     –>
    1.33
    --->
    1.30
    1.28
     -->
    
    1.25
     ->
    1.24
    Act Density 0.187%

    No Known Activations