INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Gottfried
    -0.53
     Blitz
    -0.48
    yente
    -0.47
    }),
    
    -0.47
    üf
    -0.46
    adpleegd
    -0.46
    desta
    -0.46
    yto
    -0.45
    -------
    -0.45
    richt
    -0.45
    POSITIVE LOGITS
    Car
    1.48
     car
    1.47
     Car
    1.41
    car
    1.33
     cars
    1.27
     Cars
    1.27
    Cars
    1.22
    cars
    1.21
    CAR
    1.08
     CARS
    1.02
    Act Density 0.143%

    No Known Activations