get ready for the 20th Knowledgefeed of the VDSG!
After a little bit extended summer hiatus we continue our Knowledgefeed series offering you interesting talks on various essential topics in the data science domain.
For those of you joining us for the first time, here a short description:
During these sessions we are inviting specialists presenting their hands-on experience, current project or simply ideas regarding topics related to our field of interest (the reason for this rather wide description is based on the wide scope of data science itself… ;-)). Furthermore this gives you the opportunity to ask our lecturer of the evening questions, discuss your ideas and of course enjoy a beer in company of some interesting folks!
In addition please do not hesitate to present your own projects, ideas or thoughts…we are more than gladly sharing the stage with you! Please refer to our blog-post / Discussion entry for further details.
Title: Integrating R into the big data ecosystem using sparklyR
Description: R is a powerful language for data science, but on its own it can’t cope with large amounts of big data. sparklyR bridges this gap by connecting R to the hadoop ecosystem using spark via the tidy grammar of dplyR. Agenda:
- Types of BigData
- Introduction to Hadoop & Spark (RDD)
- Integration of spark with R via sparklyR
- Downsides of spark native languages
- Streaming and R?
Duration: about 40 min
Looking forward to meeting you at the Knowledgefeed!
Stay tuned and RSVP on our Meetup: