INDEX
Explanations
mentions of characters and entities in a narrative context
robots and bots
New Auto-Interp
Negative Logits
Rüyada
-0.41
كومونز
-0.40
Италијани
-0.39
ViewFeatures
-0.39
AssemblyCulture
-0.37
省市镇
-0.36
windowFixed
-0.35
AnchorStyles
-0.35
película
-0.35
насељу
-0.34
POSITIVE LOGITS
robots
0.73
robot
0.71
chatbot
0.65
Robots
0.64
robotic
0.61
haikusbot
0.61
Robot
0.59
robo
0.58
robo
0.58
ChatGPT
0.57
Activations Density 0.176%