INDEX
Explanations
specific mentions of certain topics in the text related to video games, software development, and sports
New Auto-Interp
Negative Logits
luaj
-0.68
ngth
-0.64
isively
-0.63
resent
-0.57
riger
-0.56
rive
-0.56
bies
-0.55
istani
-0.54
ividual
-0.52
veget
-0.51
POSITIVE LOGITS
SPONSORED
0.94
blasphemy
0.76
why
0.74
understandable
0.73
happening
0.72
soType
0.69
untrue
0.68
reassuring
0.68
HO
0.68
unacceptable
0.67
Activations Density 1.042%