INDEX
Explanations
references to hippies or anything related to hippie culture
references to hippies or the hippie subculture
New Auto-Interp
Negative Logits
âĵĺ
-0.80
rawdownloadcloneembedreportprint
-0.74
BIP
-0.73
ERROR
-0.70
âϦ
-0.69
AUT
-0.69
defamation
-0.68
Charges
-0.68
DoS
-0.67
Editor
-0.66
POSITIVE LOGITS
opot
1.27
hipp
1.20
ocamp
1.12
ocrates
1.01
ocratic
0.97
olit
0.95
ocrat
0.90
eties
0.90
ocr
0.87
romeda
0.87
Activations Density 0.010%