Skip to content

api-evangelist/apache-livy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Apache Livy (apache-livy)

Apache Livy is a service that enables easy interaction with a Spark cluster over a REST interface. It allows submitting Spark jobs or snippets of Spark code, retrieving results synchronously or asynchronously, and managing Spark contexts across multiple users.

URL: Visit APIs.json URL

Run: Capabilities Using Naftiko

Tags:

  • Big Data, Interactive Computing, Open Source, REST, Spark

Timestamps

  • Created: 2026-03-16
  • Modified: 2026-04-19

APIs

Apache Livy REST API

The Livy REST API provides endpoints for creating and managing interactive Spark sessions, submitting batch Spark jobs, executing code statements, and retrieving job results and logs.

Human URL: https://livy.apache.org/docs/latest/rest-api.html

Tags:

  • Batch Jobs, REST, Sessions, Spark

Properties

Common Properties

Features

Name Description
Interactive Spark Sessions Create persistent Spark contexts for interactive code execution in Python, Scala, R, and SQL.
Batch Job Submission Submit batch Spark jobs without creating an interactive session.
Multi-Language Support Execute code in PySpark, Spark (Scala), SparkR, and SQL.
Multi-User Impersonation Proxy user support for multi-tenant Spark cluster access.
Asynchronous Execution Submit jobs and poll for results asynchronously.
Log Access Retrieve driver and executor logs for debugging.
REST Interface Simple HTTP REST API for Spark cluster interaction without native clients.

Use Cases

Name Description
Notebook Integration Power Jupyter, Zeppelin, and other notebooks with Spark backends via Livy.
Data Engineering Pipelines Submit Spark batch jobs from orchestration tools like Airflow and Oozie.
Interactive Data Exploration Execute ad-hoc Spark code for exploratory data analysis.
Multi-Tenant Spark Access Enable multiple users to share a Spark cluster with isolation via Livy sessions.

Integrations

Name Description
Apache Spark Livy requires a Spark cluster and acts as the REST gateway to Spark.
Apache Zeppelin Zeppelin notebook backend using Livy for distributed Spark execution.
Jupyter Notebook Jupyter sparkmagic extension uses Livy for remote Spark kernel access.
Apache Airflow Airflow LivyOperator for submitting Spark batch jobs from DAGs.
Amazon EMR Livy is available as an EMR application for REST-based Spark access.

Artifacts

Machine-readable API specifications organized by format.

OpenAPI

JSON Schema

12 schema files extracted from the REST API OpenAPI specification.

JSON Structure

12 JSON Structure files converted from JSON Schema files.

Examples

12 example JSON files generated from JSON Schema definitions.

Capabilities

Naftiko capabilities organized as shared per-API definitions composed into customer-facing workflows.

Shared Per-API Definitions

Workflow Capabilities

Workflow APIs Combined Tools Persona
Apache Livy Spark Job Management Apache Livy REST API 7 Data Engineer, Data Scientist

Vocabulary

Rules

Maintainers

FN: Kin Lane

Email: info@apievangelist.com

About

Apache Livy is a service that enables easy interaction with a Spark cluster over a REST interface. It allows submitting Spark jobs or snippets of Spark code, retrieving results synchronously or asynchronously, and managing Spark contexts across multiple users. Licensed under Apache 2.0.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors