Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
cinci zoo sniper
Mar 15, 2013




basically yea it gets somewhat curated content, since followers mostly are ok yosposters, someone else, and goons_txt, so there is curated layer between polov and edgy subforums

Adbot
ADBOT LOVES YOU

cinci zoo sniper
Mar 15, 2013




i wonder if watson had same problem

cinci zoo sniper
Mar 15, 2013




flakeloaf posted:

it did, they had to tidy it up after someone fed it urban dictionary
:five:

cinci zoo sniper
Mar 15, 2013




https://twitter.com/markov_polov/status/719574639274360832
polov is really into his eMusic scam :stare:

e: really

cinci zoo sniper
Mar 15, 2013




eMusic
https://twitter.com/markov_polov/status/719749686936997888

cinci zoo sniper
Mar 15, 2013




https://twitter.com/ToriCMOS/status/721659264054505472

cinci zoo sniper
Mar 15, 2013




pumpy dumper is your script written for python 3?

e: whatever it is its broken all over the place for me - first it did dis the posters_name input, then both errors='replace' in get_post_content, now it got to unicode in my posts and died.

at least my posts are bad so nothing of value was lost

cinci zoo sniper fucked around with this message at 13:28 on Apr 17, 2016

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

yeah it's Python 3
yea that explains everything, im runnin 2.7 on this shitbox. will setup python 3 environment and run it again

cinci zoo sniper
Mar 15, 2013




https://twitter.com/ymcpos/status/721674376941318144

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

yeah it works for now. I'm working on rewriting it so it's clearer to read and execute
hm, it went through 850 posts and then just closed, is this intended behaviour?

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

are there any empty quotes or image link only posts? right now it just discards those and I forgot to add a print command to tell
im running another parse to make sure i didn't just stop it by accident, will upload it afterwards. there should be plentiful emptyquotes and link-only posts

cinci zoo sniper
Mar 15, 2013




Trig Discipline posted:

it must have hit a good post, they cause divide by zero errors
this time it found a good post earlier, 660th was the stop

cinci zoo sniper
Mar 15, 2013




stop 660 - http://pastebin.com/cwMKeNuQ
stop 850 - http://pastebin.com/FmMG1MhQ

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

So mine grabbed all of your posts. It says the script ended at 920 but when checking manually the last entry was the current last entry of you're post history. So I'm assuming there are posts in there that are empty quotes
yeah, as i said already, i emptyquote often, and same goes for posts with just links. weird that it would pulled two different amounts in two identical runs

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

how long was it between when you started and when it stopped?

The way the script is set up is it takes 5 mins to pull the links for all 1000 most recent posts.

Then it takes 1s per post to scrape. So in that time you could have posted and then when you ran the script again it moved posts in the recent list around
there were maybe 30 minutes inbetween

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

hmm weird. i'll have to run it on some other posters to see if i can reproduce.
59 minutes, i still have the files around. i was posting at a time, but nowhere near 190 emptyquotes

cinci zoo sniper
Mar 15, 2013




Westie posted:

bloody outsourcing

cinci zoo sniper
Mar 15, 2013




Pumpy Dumper posted:

the code is writing itself!!!
it passes markov test

cinci zoo sniper
Mar 15, 2013




:eyepop:

cinci zoo sniper
Mar 15, 2013




AWWNAW posted:

not sure if you're already doing this for seeding but you might be able to use NTLK to get some of the subject words from a post the bot is replying to. then again maybe bad idea because the bots would converge to Pittsburgh mono posting feedback loop
they would converge on *jerking motion*

cinci zoo sniper
Mar 15, 2013




NoneMoreNegative posted:

"heinously offensive ones (there was reddit in the corpus for variety but that sometimes yields lines i'd rather not make it to twitter)"

this ain't twitter, mang - make w/ the offence :madmax:
ehhhhh, no

Adbot
ADBOT LOVES YOU

cinci zoo sniper
Mar 15, 2013




i mean i'm fine with just offensive stuff, before you mentioned source there was no telling if it doesn't construct posts from arbitrary reddit lot where you can run in a whole load of awful garbage beyond "would you rather browse in amberpos or becomes allergic to beer"

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply