jaxjug July Meeting - Introduction to Spark

Event Details

jaxjug July Meeting  - Introduction to Spark

Time: July 8, 2015 from 6pm to 8pm
Location: Availity
Street: 10752 Deerwood Park Blvd S, Ste 110
City/Town: Jacksonville FL 32256
Website or Map: http://maps.google.com/maps?q…
Phone: Eyalwir@ yahoo.com
Event Type: meeting
Organized By: Eyal Wir
Latest Activity: Jul 7, 2015

Export to Outlook or iCal (.ics)

Event Description

Introduction to Spark

Presented by Carol McDonald, MapR Technologies



Apache Spark is a fast and general engine for large-scale data processing. In contrast to Hadoop's two-stage disk-based MapReduce paradigm, Spark's in-memory primitives provide performance up to 100 times faster for certain applications.



The Spark software stack includes a core data-proccessing engine, an interface for interactive querying, Sparkstreaming for streaming data analysis, and growing libraries for machine-learning and graph analysis. Spark is quickly establishing itself as a leading environment for doing fast, iterative in-memory and streaming analysis.



This talk will give an introduction the Spark stack, explain how Spark has lighting fast results, and how it complements Apache Hadoop.


Please RSVP!
http://www.meetup.com/Jacksonville-JAVA-User-Group-JaxJUG/events/223679551/

Comment Wall

Comment

RSVP for jaxjug July Meeting - Introduction to Spark to add comments!

Join Codetown

Might attend (1)

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

Anthropic Introduces Managed Agents to Simplify AI Agent Deployment

Anthropic introduces Managed Agents on Claude, a managed execution layer for agent-based workflows. It separates agent logic from runtime concerns like orchestration, sandboxing, state management, and credentials. The system supports long-running multi-step workflows with external tools, error recovery, and session continuity via a meta-harness architecture.

By Leela Kumili

Slack Rebuilds Notification System, Reports 5X Increase in Settings Engagement

Slack has rebuilt its notification system with a unified architecture that separates activity from delivery, improving consistency across platforms. The redesign simplifies preferences, preserves legacy settings through transformation, and resulted in a 5x increase in user engagement with notification settings along with reduced support tickets.

By Leela Kumili

GitHub Acknowledges Recent Outages, Cites Scaling Challenges and Architectural Weaknesses

GitHub has publicly addressed a series of recent availability and performance issues that disrupted services across its platform, attributing the incidents to rapid growth, architectural coupling, and limitations in handling system load.

By Craig Risi

Presentation: Dynamic Moments: Weaving LLMs into Deep Personalization at DoorDash

Sudeep Das and Pradeep Muthukrishnan explain the shift from static merchandising to dynamic, moment-aware personalization at DoorDash. They share how LLMs generate natural-language "consumer profiles" and content blueprints, while traditional deep learning handles last-mile ranking. This hybrid approach allows the platform to adapt to short-lived user intent and massive catalog abundance.

By Sudeep Das, Pradeep Muthukrishnan

Article: Redesigning Banking PDF Table Extraction: A Layered Approach with Java

PDF table extraction often looks easy until it fails in production. Real bank statements can be messy, with scanned pages, shifting layouts, merged cells, and wrapped rows that break standard Java parsers. This article shares how we redesigned the approach using stream parsing, lattice/OCR, validation, scoring, and selective ML to make extraction more reliable in real banking systems.

By Mehuli Mukherjee

© 2026   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service