Virtual Artifact
  • Home
Subscribe

Getting Started

A collection of 3 posts
AWS

AWS CodeCommit Introduction - Part 2

In the previous article we looked at a general overview of the AWS CodeCommit service. In this article we will look at setup and accessing a repository in CodeCommit. There are several ways to work with CodeCommit: * AWS Management Console * Use Git credentials with HTTPs * Federated Access * Temporary credentials * Web
Jan 18, 2021 — 2 min read
Getting Started

PySpark - Spark SQL Context

Spark aims to make it easy to work with data. One way they achieve this is by working with spark data as if you were working on a SQL database * Spark SQL enables querying of DataFrames as database tables * Temporary per-session and global tables * The Catalyst optimizer makes SQL queries
Apr 20, 2020 — 6 min read
Getting Started

Spark 2 setup

Demo will cover: * Install standalone Spark on your local machine * Set up the PySpark REPL interface Req's * This demo will by done with Python 3 * Java v8 * jupyter notebooks Download Spark 2 from https://spark.apache.org/downloads.html 1. Choose the most recent 2.x build 2. Choose package:
Apr 13, 2020 — 1 min read
Virtual Artifact © 2025
  • Sign up
Powered by Ghost