INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     aseguró
    0.39
     namani
    0.36
    reward
    0.36
    udra
    0.36
    odont
    0.35
    става
    0.35
    串口
    0.35
    umela
    0.35
    vila
    0.35
     élabor
    0.35
    POSITIVE LOGITS
     F
    0.56
     FT
    0.45
     ایف
    0.41
     Fridays
    0.40
     Fn
    0.40
     friendliness
    0.40
     Friday
    0.40
     Friedrich
    0.40
     FNA
    0.39
     France
    0.38
    Act Density 0.159%

    No Known Activations