INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -2.38
    -1.00
    /**
    -0.78
    /***
    
    -0.74
     springfox
    -0.72
    ///**
    -0.70
     shivered
    -0.69
    
    
    -0.69
     shuddered
    -0.66
     quitted
    -0.66
    POSITIVE LOGITS
     Keny
    1.04
     Pasir
    0.86
     Karang
    0.86
     véhic
    0.85
     Momb
    0.85
     Khart
    0.85
     Ferdin
    0.81
     soulign
    0.80
     Tanjung
    0.80
     Batam
    0.80
    Act Density 0.168%

    No Known Activations