INDEX
Explanations
positive comments or thoughts
statements of thought or opinion expressed about various subjects
New Auto-Interp
Negative Logits
*/(
-0.82
ESE
-0.82
APD
-0.76
aptic
-0.75
soever
-0.72
20439
-0.71
ICLE
-0.71
emphasis
-0.69
umbnails
-0.68
vale
-0.67
POSITIVE LOGITS
hilarious
1.07
funny
1.04
joking
1.04
nuts
1.04
crazy
1.03
gonna
1.03
cool
1.03
cute
1.02
invincible
0.98
alright
0.95
Activations Density 0.163%