jaxjug July Meeting - Introduction to Spark

Event Details

jaxjug July Meeting  - Introduction to Spark

Time: July 8, 2015 from 6pm to 8pm
Location: Availity
Street: 10752 Deerwood Park Blvd S, Ste 110
City/Town: Jacksonville FL 32256
Website or Map: http://maps.google.com/maps?q…
Phone: Eyalwir@ yahoo.com
Event Type: meeting
Organized By: Eyal Wir
Latest Activity: Jul 7, 2015

Export to Outlook or iCal (.ics)

Event Description

Introduction to Spark

Presented by Carol McDonald, MapR Technologies



Apache Spark is a fast and general engine for large-scale data processing. In contrast to Hadoop's two-stage disk-based MapReduce paradigm, Spark's in-memory primitives provide performance up to 100 times faster for certain applications.



The Spark software stack includes a core data-proccessing engine, an interface for interactive querying, Sparkstreaming for streaming data analysis, and growing libraries for machine-learning and graph analysis. Spark is quickly establishing itself as a leading environment for doing fast, iterative in-memory and streaming analysis.



This talk will give an introduction the Spark stack, explain how Spark has lighting fast results, and how it complements Apache Hadoop.


Please RSVP!
http://www.meetup.com/Jacksonville-JAVA-User-Group-JaxJUG/events/223679551/

Comment Wall

Comment

RSVP for jaxjug July Meeting - Introduction to Spark to add comments!

Join Codetown

Might attend (1)

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

Cloudflare Builds High-Performance Infrastructure for Running LLMs

Cloudflare has recently announced new infrastructure designed to run large AI language models across its global network. As these models rely on costly hardware and must handle large volumes of incoming and outgoing text, Cloudflare separates the model's input processing and output generation onto different optimized systems.

By Renato Losio

DuckLake 1.0: Data Lake Format with SQL Catalog Metadata

DuckDB Labs recently released DuckLake 1.0, a data lake format that stores table metadata in a SQL database rather than across many files in object storage. The first implementation is available as a DuckDB extension and includes catalog-stored small updates, improved sorting and partitioning options, and compatibility with Iceberg-style data features.

By Renato Losio

JobRunr Introduces ClawRunr, an Open-Source Java AI Agent

JobRunr has introduced ClawRunr, an open-source Java AI agent for scheduled, recurring, and one-off background tasks. Formerly JavaClaw, it runs on users' hardware and combines conversational interaction with persistent task execution, MCP tools, browser automation, and web, Telegram, and Discord channels, while using JobRunr for scheduling, retries, and monitoring.

By Diogo Carleto

Confluent Moves Schema IDs to Kafka Headers to Simplify Schema Governance

Confluent introduces a new approach in Apache Kafka that moves schema IDs from message payloads to record headers, aiming to simplify schema governance and evolution. The update integrates with Schema Registry, improves compatibility across serialization formats, and reduces coupling between data and metadata in event-driven architectures.

By Leela Kumili

Meta Deploys Unified AI Agents to Automate Performance Optimization at Hyperscale

Meta has unveiled a new AI-driven capacity efficiency platform that uses unified AI agents to automatically detect and resolve performance issues across its global infrastructure, marking a significant step toward self-optimizing systems at hyperscale.

By Craig Risi

© 2026   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service