INDEX
Explanations
text indicating laughter or amusement, specifically the abbreviation "lol" with varying levels of emphasis
expressions of laughter or amusement
New Auto-Interp
Negative Logits
éĹ
-0.78
States
-0.77
icipated
-0.75
ENTS
-0.73
":["
-0.71
glim
-0.70
Topics
-0.69
arnaev
-0.67
Phant
-0.66
Catal
-0.65
POSITIVE LOGITS
ipop
1.28
ita
1.05
cow
1.00
creen
0.87
cats
0.85
ipedia
0.84
ogie
0.84
ibrary
0.81
lol
0.78
oco
0.78
Activations Density 0.013%