|
Pie Colony posted:if the company is well-known or has room to be picky with candidates, chances are they don't really just want a "sum these up, sort and return the first 10" solution. they are probably looking for candidates that know to use a heap to do the sorting in n log k, and probably someone that does all the counting in parallel. worry about that poo poo at the actual interview
|
# ? Sep 15, 2014 22:18 |
|
|
# ? Jun 5, 2024 05:38 |
|
syntaxrigger posted:So an ML shop contacted me because I have java in my resume and I have lofty goals of one day not sucking at ML. They sent me a programming test that I did horrible in because I am not strong enough in when to use what data structures and algorithims. So instead of wallowing coffetable convinced me to post here for some insight into what I should have done. load it into sqlite or something, virtually all of those can be done in a single query each. sqlite3 shell has a tsv loading command i think
|
# ? Sep 15, 2014 22:49 |
|
coffeetable posted:why're you intimidated by that ML syllabus you sound like my mum when she pushed me to do ib standard maths instead of studies because "you don't want to close doors!!!!" even tho i didn't meet the prereqs it was an entirely predictable catastrofuck and it ended up forcing me to do some godawful hybrid ap/ib thing for my last year of hs
|
# ? Sep 16, 2014 11:31 |
|
math is not easy for anyone
|
# ? Sep 16, 2014 18:16 |
|
what's ib
|
# ? Sep 16, 2014 18:32 |
|
international baccalaureate, it's like the ap but for europe
|
# ? Sep 16, 2014 18:36 |
|
lol if you do serious ml work in languages that are slow, aka not in c++ or fortran
|
# ? Sep 16, 2014 19:00 |
|
lol if you don't just call out to c++ or fortran for anything that actually needs to be fast
|
# ? Sep 16, 2014 19:01 |
|
coffeetable posted:lol if you don't just call out to c++ or fortran for anything that actually needs to be fast
|
# ? Sep 16, 2014 19:02 |
|
assume a fitness function f that maps a languages ability to: fart around in a file system janitor strings call out w/ c calling convention solve for the global maximum
|
# ? Sep 16, 2014 19:06 |
|
my current solution is c# but i think its just a local maximum
|
# ? Sep 16, 2014 19:06 |
|
im p impressed w python so far. the language itself is garbage but scikit-learn and pymc are both really good found a cluster in some medical trial data today that another company had completely missed. no idea if it means anything yet - we're blinded - but gently caress yeah variational methods e: it was the size of a house as well. from what i can tell they just banged it through PCA and some hierarchical clustering heuristic and went "welp nuttin there". half of it looked like it was done in excel too coffeetable fucked around with this message at 19:20 on Sep 16, 2014 |
# ? Sep 16, 2014 19:16 |
|
"data science" is a dogwhistle for pca
|
# ? Sep 16, 2014 19:24 |
|
coffeetable posted:im p impressed w python so far. the language itself is garbage but scikit-learn and pymc are both really good how did you find it
|
# ? Sep 16, 2014 19:52 |
|
FamDav posted:how did you find it http://scikit-learn.org/stable/modules/dp-derivation.html
|
# ? Sep 16, 2014 20:00 |
|
ok seriously what does it take for my eyes to not immediately gloss over at the sight of unfamiliar greek letters where the gently caress did i go so wrong
|
# ? Sep 16, 2014 20:02 |
|
thatd be 10000x more readable to me if instead of they actually wrote the poo poo out lmao transparency
|
# ? Sep 16, 2014 20:03 |
|
Bloody posted:thatd be 10000x more readable to me if instead of theres only 24 greek letters, plus a few capitals, phi mu and sigma are pretty common and hey they did write out Gamma instead of using an actual Gamma
|
# ? Sep 16, 2014 20:10 |
|
fritz posted:theres only 24 greek letters, plus a few capitals, phi mu and sigma are pretty common looking further they use nu, which is a real nu-sance
|
# ? Sep 16, 2014 20:14 |
|
24 symbols my brain processes in idiotic ways a, b, y (uppercase is just a travesty tho), triangle, e, squiggle z, n, theta (lol if you dont theta), i, k, eigenthing, micro, v, jesus christ what is this letter, o, pi, stupid p, capital sigma obvi but those other two just lol (jiffy lube) (why the gently caress are there two), t, u, hosed up pitchfork, x, pitchfork, ohms going off the first table in https://en.wikipedia.org/wiki/Greek_alphabet
|
# ? Sep 16, 2014 20:19 |
|
idk id just always rather read pseudocode than maths because my background is stronger in codethings than mathsthings
|
# ? Sep 16, 2014 20:23 |
|
Bloody posted:24 symbols my brain processes in idiotic ways zeta is a girl with a ponytail on a swing, xi is the same with arms sticking out. there are two small sigmas because you use a different one at the end of a word!
|
# ? Sep 16, 2014 20:34 |
|
i don't know how people program stuff without intellisense and linq, i can barely program with them
|
# ? Sep 16, 2014 20:37 |
|
Bloody posted:thatd be 10000x more readable to me if instead of for the record though, here it is in english: each sample (X_i) is drawn from a normal distribution. which set of parameters are used for that normal distribution is decided by the indicator variable for that sample, z_i. that indicator is in turn drawn from a Dirichlet process* whose hyperparameters are drawn from a beta distribution. alternatively: the Dirichlet process models the assignment of samples to clusters, the normal-gamma distributions model the parameters for each cluster, and the normal distribution models the samples within each cluster *aka stick-breaking process, which is why it's denoted SBP
|
# ? Sep 17, 2014 07:17 |
|
they are just pictures, its just like learning chinese
|
# ? Sep 17, 2014 07:36 |
|
pointsofdata posted:i don't know how people program stuff without intellisense and linq, i can barely program with them me @ work : *hunt and peck 2 or 3 letters* *mash ctrl+space* then i take 5
|
# ? Sep 17, 2014 12:10 |
|
Bloody posted:idk id just always rather read pseudocode than maths because my background is stronger in codethings than mathsthings yeah, i browsed some math-y books on coworkers desks and there's the assumption that you know what all these symbols mean or imply already without even defining other stuff or what it's supposed to do
|
# ? Sep 17, 2014 16:44 |
|
Share Bear posted:yeah, i browsed some math-y books on coworkers desks and there's the assumption that you know what all these symbols mean or imply already without even defining other stuff or what it's supposed to do some books have a table with their particular notations in the introduction
|
# ? Sep 17, 2014 17:26 |
|
fritz posted:some books have a table with their particular notations in the introduction or on the inside of the back cover
|
# ? Sep 17, 2014 17:30 |
|
hello, first, thanks shaggar for the advice earlier. i got a project up and running with all the nice stuff maven and mybatis provide. I have a doubt, is using a singleton for my mybatis SqlSessionFactory appropriate? here is the singleton btw: Java code:
|
# ? Sep 17, 2014 23:04 |
|
eh, that's one way to do it I guess. I prefer to configure the datasource in my spring config and let mybatis-spring (another lib) generate proxies for my mapping interfaces. then I just inject those into where I use them. so if I have an interface like Java code:
Obviously you can implement the SomeDataMapper interface yourself with a singleton to generate sqlsessions that you use inside the implementation of the getThing method. But its gonna be dumb boilerplate, so mybatis-spring comes to the rescue. w/ mybatis-spring you add a line that loads the spring mybatis mapper thingy (idr the real name) and it hunts down ur mapper xmls and matches them up with your mapper interfaces and builds proxy objects in the bean container with the same name as your interface. so in our example this would be a bean named someDataMapper. then wherever you need to use that interface you inject the proxy bean into your code from spring. Idk if that helps cause idk how much spring you know. Just sticking w/ the singleton is probably fine, but learning spring has a load of advantages once you get it.
|
# ? Sep 18, 2014 02:07 |
|
Forums Terrorist posted:international baccalaureate, it's like the ap but for europe and helicopters families in the us
|
# ? Sep 18, 2014 02:20 |
|
Serious question, what is a factory and how is it different from a constructor
|
# ? Sep 19, 2014 10:12 |
|
MeruFM posted:Serious question, what is a factory and how is it different from a constructor it's different from a constructor because the object it returns may not always be the same class and may not always be a new instance.
|
# ? Sep 19, 2014 10:35 |
|
MeruFM posted:Serious question, what is a factory and how is it different from a constructor a factory is good for returning different versions of an object depending on state. like let's assume insurance is a godawful cluster gently caress, i know farfetched. there's hundreds of different companies and thousands of different policies and you're trying to get the billing on a root canal and capping so that the dentist can buy his trophy wife a make-up bag. billings are done completely differently by each company and are further differentiated by each policy. luckily each person has their policy id and insurance company in the record so you just call GitMoneyFactory.dollaDollaBillzYo() with your person object and it returns the corresponding GitMoney class capable of doing all that poo poo you need to do for FYGM insurance company that the patient belongs to. if you had used a constructor instead youd need a switch with thousands of cases.
|
# ? Sep 19, 2014 19:16 |
|
ii oh el posted:a factory is good for returning different versions of an object depending on state. You missed a big part, the GitMoneyFactory is the one that actually contains the switch statement with thousands of cases. The point is that the terrible clusterfuck of logic is encapsulated so that you don't have to copy and paste it everywhere in your code that wants to just get paid.
|
# ? Sep 19, 2014 20:33 |
|
Janitor Prime posted:You missed a big part, the GitMoneyFactory is the one that actually contains the switch statement with thousands of cases. The point is that the terrible clusterfuck of logic is encapsulated so that you don't have to copy and paste it everywhere in your code that wants to just get paid. you're right i should've explained that. thanks for pointing it out. i sometimes think that i imply connections clearly but on reflection haven't at all.
|
# ? Sep 19, 2014 20:49 |
|
so ive got a question. I am making something that revolves around people comitting code to github. problem is i know most people wont commit and push frequently enough for my site to be..accurate. so tell me how people SHOULD be using version control, no smart rear end answers please!!! should it be commit often and push often, donrt let too many commits build up locally before pushing them?
|
# ? Sep 19, 2014 22:35 |
|
It's not just about the number of commits, but the stability of the feature your working on and the impact it has on other people who will pull your changes in a half finished state. This is why branching is important, so that they don't fuckup the main line of development, but are free to push their changes whenever they want and in your case you can feel better about telling them to push often so that your tracking thing shows up to date information.
|
# ? Sep 19, 2014 22:40 |
|
|
# ? Jun 5, 2024 05:38 |
|
Vincent K. McMahon posted:so ive got a question. I am making something that revolves around people comitting code to github. problem is i know most people wont commit and push frequently enough for my site to be..accurate. you dont push 'commits' per se you push a completed item (where item is a bug fix or a feature or whatever) They should be working in some local branch on their crap then when its ready to go merge/rebase/whatever it into master and push from there once for the whole item.
|
# ? Sep 19, 2014 22:45 |