Learning Spark / Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia

Por: Karau, Holden [,autor]Colaborador(es): Konwinski, Andy [,autor] | Wendell, Patrick [,autor] | Zaharia, Matei [,autor]Tipo de material: TextoTextoEditor: Beijing ; O'Reilly, [2015]Edición: First editionDescripción: xvi, 254 páginas: ilustracionesTipo de contenido: texto Tipo de medio: no mediado Tipo de portador: volumenISBN: 9781449358624; 1449358624Tema(s): BIG DATA | MINERIA DE DATOS -- PROGRAMAS PARA COMPUTADOR0Clasificación CDD: 006.312
Contenidos:
Introduction to data analysis with Spark -- Downloading Spark and getting started -- Programming with RDDs -- Working with key/value pairs -- Loading and saving your data -- Advanced Spark programming -- Running on a cluster -- Tuning and debugging Spark -- Spark SQL -- Spark streaming -- Machine learning with MLlib.
Resumen: This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.-- Source other than Library of Congress.
Etiquetas de esta biblioteca: No hay etiquetas de esta biblioteca para este título. Ingresar para agregar etiquetas.
Valoración
    Valoración media: 0.0 (0 votos)
Existencias
Tipo de ítem Biblioteca actual Colección número de clasificación Copia número Estado Notas Fecha de vencimiento Código de barras
Libro General Libro General Biblioteca Campus San Joaquín
Colección General 006.312 K18 (Navegar estantería(Abre debajo)) 1 Disponible 35609002082652

Subtítulo en cubierta : Lightning-fast data analysis.

Incluye índice

Introduction to data analysis with Spark -- Downloading Spark and getting started -- Programming with RDDs -- Working with key/value pairs -- Loading and saving your data -- Advanced Spark programming -- Running on a cluster -- Tuning and debugging Spark -- Spark SQL -- Spark streaming -- Machine learning with MLlib.

This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.-- Source other than Library of Congress.