INDEX
Explanations
repeated occurrences of the word "got."
New Auto-Interp
Negative Logits
felizes
-0.70
betweenstory
-0.64
sarili
-0.56
astă
-0.56
tences
-0.56
wikimedia
-0.56
carottes
-0.56
Freien
-0.55
doInBackground
-0.55
TestBed
-0.55
POSITIVE LOGITS
got
0.90
got
0.85
Got
0.84
Got
0.82
gotta
0.80
possess
0.72
GOT
0.67
GOT
0.67
Have
0.66
Gotcha
0.63
Activations Density 0.116%