Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
Bloody
Mar 3, 2013

MononcQc posted:

hi everyone sorry i dont post here as often as i should here is a pic of me browsing yospos



0/10 monitor not displaying yospos color theme not cga

Adbot
ADBOT LOVES YOU

MononcQc
May 29, 2007

im using elinks you scrub.


also here's me before going to bed ;)

Blotto Skorzany
Nov 7, 2008

He's a PSoC, loose and runnin'
came the whisper from each lip
And he's here to do some business with
the bad ADC on his chip
bad ADC on his chiiiiip
find a way to shoehorn these into thisotplife imo

Posting Principle
Dec 10, 2011

by Ralp

MononcQc posted:

im using elinks you scrub.


also here's me before going to bed ;)



wow so erlang

MononcQc
May 29, 2007

Started reading on gossip protocols and epidemic stuff. This paper is a decent intro.

rotor
Jun 11, 2001

classic case of pineapple on pizzadog derangement syndrome
read a few pages back someone was talkin poo poo about awk and my feeling is that they can go gently caress themselves with a forked stick

FamDav
Mar 29, 2008
rotor are you my dad

rotor
Jun 11, 2001

classic case of pineapple on pizzadog derangement syndrome

FamDav posted:

rotor are you my dad

i've done a lot of things in my life i'm not proud of

Gazpacho
Jun 18, 2004

by Fluffdaddy
Slippery Tilde

rotor posted:

i've done a lot of things in my life i'm not proud of

Blotto Skorzany
Nov 7, 2008

He's a PSoC, loose and runnin'
came the whisper from each lip
And he's here to do some business with
the bad ADC on his chip
bad ADC on his chiiiiip
:guinness:

Lysidas
Jul 26, 2002

John Diefenbaker is a madman who thinks he's John Diefenbaker.
Pillbug
awk has its uses, like most good tools, sometimes a busybox shell is all you have like in initrd images

Zombywuf
Mar 29, 2008

I wrote an awk once to take the output of the pep8 tool and turn it into an awk script that fixed the pep8 errors in the original file. This was the easiest way to make tef's code readable.

Zombywuf
Mar 29, 2008

Cool as gently caress: http://spritesmods.com/?art=hddhack&page=1

Notorious b.s.d.
Jan 25, 2003

by Reene

MononcQc posted:

Whatever, I did find Awk pretty nice for the way it works. It's rather straight to the point, fast enough, and standard in most places so it's nice to put a specific short script together to gather data. Anything bigger we send to splunk though.

i was talkin to a dude who does consulting on hbase and hadoop poo poo
apparently every single one of his customers is doing request log parsing

thousand node compute grids to read httpd logs no joke

this may be how splunk made themselves a billion dollar company idk

tef
May 30, 2004

-> some l-system crap ->

MononcQc posted:

Started reading on gossip protocols and epidemic stuff. This paper is a decent intro.


noice

Jonny 290
May 5, 2005



[ASK] me about OS/2 Warp

Notorious b.s.d. posted:

i was talkin to a dude who does consulting on hbase and hadoop poo poo
apparently every single one of his customers is doing request log parsing

thousand node compute grids to read httpd logs no joke

this may be how splunk made themselves a billion dollar company idk

seriously? jesus i can eat logs and poo poo excels and csvs all day. thread that poo poo, too. TURBO STYLE. how many cores u got, bitch

Zombywuf
Mar 29, 2008

Notorious b.s.d. posted:

i was talkin to a dude who does consulting on hbase and hadoop poo poo
apparently every single one of his customers is doing request log parsing

thousand node compute grids to read httpd logs no joke

this may be how splunk made themselves a billion dollar company idk

That and Splunk charge in scales of porsches ($100,0000) and houses ($250,000). Because no-one else can scale log parsing like they can. Their secret? Custom awk, gzipped plain text log files and a few scraps of Python.

Where I currently work we do with 1 db what major academic research projects do with Hadoop, I can't remember how many nodes but it's a lot more than 1.

X-BUM-RAIDER-X
May 7, 2008
hi I got a promotion today so I get to keep on doing the exact same poo poo except feel more important somehow

Max Facetime
Apr 18, 2009

interactive visualization of cpu and memory speeds

X-BUM-RAIDER-X
May 7, 2008
funny how easy it is to move up in programming while doing next to nothing

prefect
Sep 11, 2001

No one, Woodhouse.
No one.




Dead Man’s Band

OBAMA BIN LinkedIn posted:

hi I got a promotion today so I get to keep on doing the exact same poo poo except feel more important somehow

congratulations! :thumbsup:

Notorious b.s.d.
Jan 25, 2003

by Reene

Zombywuf posted:

That and Splunk charge in scales of porsches ($100,0000) and houses ($250,000). Because no-one else can scale log parsing like they can. Their secret? Custom awk, gzipped plain text log files and a few scraps of Python.

Where I currently work we do with 1 db what major academic research projects do with Hadoop, I can't remember how many nodes but it's a lot more than 1.

well the parsing is the easy part, especially if you do most of it on the clients/agents

what is their magic for the full text index? that poo poo is wicked fast

Notorious b.s.d.
Jan 25, 2003

by Reene

Jonny 290 posted:

seriously? jesus i can eat logs and poo poo excels and csvs all day. thread that poo poo, too. TURBO STYLE. how many cores u got, bitch

believe it or not a lot of hadoop users aren't doing anything with the cool java-based hadoop/cascading APIs, just using "streaming"

hadoop "streaming" is when your hadoop job is python/perl scripts that talk on stdin/stdout. world's most complicated job control for unix pipes

Jonny 290
May 5, 2005



[ASK] me about OS/2 Warp

Notorious b.s.d. posted:

believe it or not a lot of hadoop users aren't doing anything with the cool java-based hadoop/cascading APIs, just using "streaming"

hadoop "streaming" is when your hadoop job is python/perl scripts that talk on stdin/stdout. world's most complicated job control for unix pipes

hahaha for fucks sake

i need to get to denver and start slutting it up

Malcolm XML
Aug 8, 2009

I always knew it would end like this.

This is cool

it needs cache line support, and Disk/SSD/Network visualizations (with time compression ofc)

Zombywuf
Mar 29, 2008

Notorious b.s.d. posted:

what is their magic for the full text index? that poo poo is wicked fast

gzip and grep (and a healthy disk cache).

Seriously.

Their indexes are just gzipped text files. There's no magic, think of it as a column store with one column and good compression.

Think about how many instructions you can execute in the time it takes to read a single page from a disk and you'll see why gzip is a good solution.

MononcQc
May 29, 2007

Latest Awk program helped diagnose why a node crashed by identifying a concurrency bottleneck from a crash dump :3: Awk owns.


E: gaddamn I gotta find a new avatar

MononcQc fucked around with this message at 15:23 on Aug 7, 2013

abraham linksys
Sep 6, 2010

:darksouls:
LiveScript is a fork of Coco, which is itself a fork of CoffeeScript

:shepface:

Jonny 290
May 5, 2005



[ASK] me about OS/2 Warp

abraham linksys posted:

LiveScript is a fork of Coco, which is itself a fork of CoffeeScript

:shepface:

lol ppl just making poo poo up now

abraham linksys
Sep 6, 2010

:darksouls:

Jonny 290 posted:

lol ppl just making poo poo up now

now?

Bloody
Mar 3, 2013

MononcQc posted:

Latest Awk program helped diagnose why a node crashed by identifying a concurrency bottleneck from a crash dump :3: Awk owns.


E: gaddamn I gotta find a new avatar
uve probably missed the cgatar bandwagontrain at this point but somebody might be able to bail you out

in fact i may go cga that mugshot of you like right now

eh nevermind its coming out too lovely

Bloody fucked around with this message at 15:55 on Aug 7, 2013

tef
May 30, 2004

-> some l-system crap ->
yosposting from inside moz london

Vanadium
Jan 8, 2005

do they have red pandas there too?

prefect
Sep 11, 2001

No one, Woodhouse.
No one.




Dead Man’s Band

tef posted:

yosposting from inside moz london

how many people there speak with cockney accents?

Zombywuf
Mar 29, 2008

prefect posted:

how many people there speak with cockney accents?

None of them will speak with cockerney accents, I can tell you that.

Zaxxon
Feb 14, 2004

Wir Tanzen Mekanik

tef posted:

yosposting from inside moz london

But in the U.K. moz means Morrissey.

X-BUM-RAIDER-X
May 7, 2008

prefect posted:

how many people there speak with cockney accents?

they all actually speak like this https://www.youtube.com/watch?v=HSPwqV8CNG0

PENETRATION TESTS
Dec 26, 2011

built upon dope and vice

Notorious b.s.d. posted:

i was talkin to a dude who does consulting on hbase and hadoop poo poo
apparently every single one of his customers is doing request log parsing

thousand node compute grids to read httpd logs no joke

this may be how splunk made themselves a billion dollar company idk

as far as i can tell the companies with thousand-plus node clusters are all doing slightly more than parsing, more like clickstream analysis, with parsing and preprocessing happening upstream

but there is a whole lot of needless or inefficient use of hadoop, probably because the teams that build and run the infrastructure have a vested interest in expanded use and increased scale

and because it's really fuckin easy to use hadoop streaming

prefect
Sep 11, 2001

No one, Woodhouse.
No one.




Dead Man’s Band

PENETRATION TESTS posted:

as far as i can tell the companies with thousand-plus node clusters are all doing slightly more than parsing, more like clickstream analysis, with parsing and preprocessing happening upstream

but there is a whole lot of needless or inefficient use of hadoop, probably because the teams that build and run the infrastructure have a vested interest in expanded use and increased scale

and because it's really fuckin easy to use hadoop streaming

the last thing is what i'd put money on. anything that's "really fuckin easy" is hard to argue against :)

Adbot
ADBOT LOVES YOU

PENETRATION TESTS
Dec 26, 2011

built upon dope and vice
at least in the org i work with the most egregious offender is Hive, people do huge fuckin queries that spin up tens of thousands of mappers to do simple SQL-like queries... they end up taking ten minutes or so, so gently caress it

stuff like give me this entry in the user table joined to her entries in this other table -> read in the entirety of both tables, throw out all but a few rows, join them on one reducer while the other 99 get no input

they're all one-off queries so they aren't even much of a burden but they're just so offensively inefficient given that all the same data is in a big expensive and efficient relational database

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply