15 skills found
Soostone / HadronConstruct and run Hadoop MapReduce programs in Haskell
crowbar-archive / Barclamp Pig[UNMAINTAINED] Hadoop Pig: Mapreduce Programming component
DmitryMezhensky / Hadoop And Swift IntegrationAPI to run Hadoop MapReduce programs over Swift
blrunner / Hadoop Beginners ExampleHDFS and MapReduce example source code accompanying wikibooks "Beginning Hadoop Programming" by Jaehwa Jung
jhlivingstone / WeightedGraphMax SavedPathThis Hadoop MapReduce program operates on a directed graph, in adjacency list format. The program computes the maximum total of node weights, from top to bottom of the directed graph, and records the path taken to get to the maximum total node weight, by performing a breadth-first graph search using an iterative map-reduce algorithm.
shask9 / Matrix Multiplication HadoopHadoop MapReduce program to compute multiplication of two sparse matrices
agrawal-priyank / Restaurant Recommender System AWS Hadoop MapReduceAcademic project for Advance Database Management Systems (Big-Data) course
dataApps / Chlorine HadoopMapreduce program to detect and mask sensitive data in Hadoop
HxnDev / Hadoop MapReduce To Analyze Sentiment Of KeywordIn this task, we had to write a MapReduce program to analyze the sentiment of a keyword from a list of comments. This was done using Hadoop HDFS.
Ritik2703 / Coursera Introduction To Big Data By University Of California San Diego<h1>hare krishna</h1> Here’s an overview of our goals for you in the course. After completing this course you should be able to: - Describe the Big Data landscape including examples of real world big data problems including the three key sources of Big Data: people, organizations, and sensors. - Explain the V’s of Big Data (volume, velocity, variety, veracity, valence, and value) and why each impacts data collection, monitoring, storage, analysis and reporting. - Get value out of Big Data by using a 5-step process to structure your analysis. - Identify what are and what are not big data problems and be able to recast big data problems as data science questions. - Provide an explanation of the architectural components and programming models used for scalable big data analysis. - Summarize the features and value of core Hadoop stack components including the YARN resource and job management system, the HDFS file system and the MapReduce programming model. - Install and run a program using Hadoop! Throughout the course, we offer you various ways to engage and test your proficiency with these goals. Required quizzes seek to give you the opportunity to retrieve core definitions, terms, and key take-away points. We know from research that the first step in gaining proficiency with something involves repeated practice to solidify long-term memory. But, we also offer a number of optional discussion prompts where we encourage you to think about the concepts covered as they might impact your life or business. We encourage you to both contribute to these discussions and to read and respond to the posts of others. This opportunity to consider the application of new concepts to problems in your own life really helps deepen your understanding and ability to utilize the new knowledge you have learned. Finally, we know this is an introductory course, but we offer you one problem solving opportunity to give you practice in applying the Map Reduce process. Map Reduce is a core programming model for Big Data analysis and there’s no better way to make sure you really understand it than by trying it out for yourself! We hope that you will find this course both accessible, but also capable of helping you deepen your thinking about the core concepts of Big Data. Remember, this is just the start to our specialization -- but it’s also a great time to take a step back and think about why the challenges of Big Data now exist and how you might see them impacting your world -- or the world in the future!
HawxChen / CloudComputingMapReduce, Spark, Hadoop, PostgreSQL, Cluster Management
aumashankar / HadoopHadoop HDFS,MapReduce,PIG,HIVE programs
ragu8 / DSCP507 MAPREDUCE PROGRAMMING WITH HADOOPMapReduce is a programming paradigm that enables massive scalability across hundreds or thousands of servers in a Hadoop cluster
yoni / R Streaming MapReduce ExampleAn example R MapReduce script taken from from http://wiki.abhishek-tiwari.com/html/r-programming/r-hadoop-mapreduce.html
dboston1 / Reddit Sentiment AnalysisProgram that performs textual analysis of Reddit data (approx. 300 GB) preprocessed by another team member. Uses Hadoop's Mapreduce to classify comments as either positive or negative based on certain keywords, negation, etc.