The Data Wrangler's Handbook: Simple Tools for Powerful Results

Data manipulation and analysis are far easier than you might imagine—in fact, using tools that come standard with your desktop computer, you can learn how to extract, manipulate, and analyze data (and metadata) of any size and complexity. In this handbook, data wizard Banerjee will familiarize you with easily digestible but powerful concepts that will enable you to feel confident working with data. With his expert guidance, you’ll learn how to

  • use a single-word command to sort files of any size by any criteria, identify duplicates, and perform numerous other common library tasks;
  • understand data formats, delimited text and CSV files, XML, JSON, scripting, and other key components of data;
  • undertake more sophisticated tasks such as comparing files, converting data from one format to another, reformatting values, combining data from multiple files, and communicating with APIs (Application Programming Interfaces);
  • save time and stress through simple techniques for transforming text, recognizing symbols that perform important tasks, a Regular Expression cheat sheet, a glossary, and other tools.

Library technologists and those involved in maintaining and analyzing data and metadata will find Banerjee’s resource essential.

"1134050047"
The Data Wrangler's Handbook: Simple Tools for Powerful Results

Data manipulation and analysis are far easier than you might imagine—in fact, using tools that come standard with your desktop computer, you can learn how to extract, manipulate, and analyze data (and metadata) of any size and complexity. In this handbook, data wizard Banerjee will familiarize you with easily digestible but powerful concepts that will enable you to feel confident working with data. With his expert guidance, you’ll learn how to

  • use a single-word command to sort files of any size by any criteria, identify duplicates, and perform numerous other common library tasks;
  • understand data formats, delimited text and CSV files, XML, JSON, scripting, and other key components of data;
  • undertake more sophisticated tasks such as comparing files, converting data from one format to another, reformatting values, combining data from multiple files, and communicating with APIs (Application Programming Interfaces);
  • save time and stress through simple techniques for transforming text, recognizing symbols that perform important tasks, a Regular Expression cheat sheet, a glossary, and other tools.

Library technologists and those involved in maintaining and analyzing data and metadata will find Banerjee’s resource essential.

40.99 In Stock
The Data Wrangler's Handbook: Simple Tools for Powerful Results

The Data Wrangler's Handbook: Simple Tools for Powerful Results

by Kyle Banerjee
The Data Wrangler's Handbook: Simple Tools for Powerful Results

The Data Wrangler's Handbook: Simple Tools for Powerful Results

by Kyle Banerjee

eBook

$40.99  $54.00 Save 24% Current price is $40.99, Original price is $54. You Save 24%.

Available on Compatible NOOK devices, the free NOOK App and in My Digital Library.
WANT A NOOK?  Explore Now

Related collections and offers

LEND ME® See Details

Overview

Data manipulation and analysis are far easier than you might imagine—in fact, using tools that come standard with your desktop computer, you can learn how to extract, manipulate, and analyze data (and metadata) of any size and complexity. In this handbook, data wizard Banerjee will familiarize you with easily digestible but powerful concepts that will enable you to feel confident working with data. With his expert guidance, you’ll learn how to

  • use a single-word command to sort files of any size by any criteria, identify duplicates, and perform numerous other common library tasks;
  • understand data formats, delimited text and CSV files, XML, JSON, scripting, and other key components of data;
  • undertake more sophisticated tasks such as comparing files, converting data from one format to another, reformatting values, combining data from multiple files, and communicating with APIs (Application Programming Interfaces);
  • save time and stress through simple techniques for transforming text, recognizing symbols that perform important tasks, a Regular Expression cheat sheet, a glossary, and other tools.

Library technologists and those involved in maintaining and analyzing data and metadata will find Banerjee’s resource essential.


Product Details

ISBN-13: 9780838919101
Publisher: American Library Association
Publication date: 08/05/2019
Sold by: Barnes & Noble
Format: eBook
Pages: 176
File size: 4 MB

About the Author

Kyle Banerjee has twenty years' library experience, extensive systems knowledge, and has planned and written software to support ILS, digital collections, and resource-sharing system migrations since 1996. He coauthored two other textbooks about digital libraries and has written numerous articles on library automation.

Table of Contents

Cover Page Title Page Copyright Page Contents Figure and Tables Acknowledgments Introduction Mac Windows Meet the Command Line Two Powerful Symbols Direct Output to a File (Greater than Symbol) Direct Output to Another Program (Pipe Symbol) Command Substitution Regular Expressions—The Swiss Army Knife for Data Literal Characters Wildcard Characters Grouping Scripting Chapter 3. Understanding Formats, by David Forero Chapter 4. Simplify Complicated Problems Isolating Specific Data Elements Converting Data into Formats That Are Easier to Work With Chapter 5. Delimited Text Commas and Quotation Marks in CSV Files Multiline Fields in CSV Files Multivalued Fields in Delimited Files Chapter 6. XML So What Is XML, Really? What Makes XML So Useful? Why Is XML So Easy? DOM (Document Object Model) XPath XSLT (eXtensible Stylesheet Language Transformations) Working with Large XML Files Working with Complex XML Files XmlStarlet Installing XmlStarlet Converting XML Documents Chapter 7. JSON (JavaScript Object Notation) Chapter 8. Scripting Variables Arguments Conditional Execution Loops Locating Files That Contain Particular Data Working with Internal Metadata Working with APIs Combining Data from Different Sources Other Tasks Chapter 10. Conclusions One-Line Wonders Identify All Files in Current Directories and Subdirectories That Contain a Value View Lines 4369–4374 of a File Count Records Containing an Expression Iterate through Every Item in Parameter List Use Foreign Character Sets in a Terminal Window Convert List of Names from Direct Order to Indirect Order Remove Newline Characters from a File Convert Comma Delimited File Where Some Values Are Quoted and Some Values Are Not to Tab Delimited Find the Most Common Values in the Second Field of a File Delete Elements, Attributes, or Values Based on XPath Expressions Pretty Print XML Document Glossary Symbols That Perform Important Tasks Useful Commands Regular Expression Cheat Sheet Index
From the B&N Reads Blog

Customer Reviews