INDEX
Explanations
advertisements within a text
occurrences of advertisements within the text
New Auto-Interp
Negative Logits
grass
-0.70
Kut
-0.67
Triangle
-0.65
Roses
-0.61
ãĥ¼ãĥĨãĤ£
-0.60
Ultr
-0.59
incarn
-0.58
stood
-0.58
masse
-0.57
Klu
-0.56
POSITIVE LOGITS
ADVERTISEMENT
1.05
Skip
1.01
iciary
0.80
Appears
0.78
Thanks
0.77
Continued
0.73
iseum
0.70
ILA
0.69
olson
0.69
VERTISEMENT
0.68
Activations Density 0.009%