Big Data

External Article

Big Data Basics - Introduction to HDFS

  • Article

One of the core components of the Hadoop framework and responsible for the storage aspect is HDFS. Unlike the usual storage available on our computers, HDFS is a Distributed File System and parts of a single large file can be stored on different nodes across the cluster. Here are some of the key concepts related to HDFS.

2014-03-14

4,123 reads

External Article

Disaster Recovery and the Big Data Application

  • Article

Ask database administrators how they implement disaster recovery in their big data environments and you'll get two typical responses: DR plans are not necessary and backups take up a lot of space. Despite this reasoning, a disaster recovery plan for your big data implementation may be essential for your company's future.

2014-02-04

4,531 reads

External Article

The DBA's Big Data Holiday Wish List

  • Article

Big data is now a standard part of information technology architecture for most large organizations. As a database administrator, with the holiday season upon us, I have the following items and notions on my holiday wish list. Here's hoping that I am gifted one or more of these; each one gives me something that I want or need.

2013-12-31

4,492 reads

External Article

Tuning Distributed Applications to Access Big Data

  • Article

Distributed applications are just that: distributed across one or more hardware platforms across the enterprise. The database administrator (DBA) has the unenviable task of monitoring these environments and configuring and tuning the database server to meet multiple needs. As multiple distributed applications now require access to a very large data store, what tuning options are available to help?

2013-10-31

3,024 reads

External Article

Database Backup and Recovery in the Age of Big Data

  • Article

Big data is the latest craze. Hardware and software vendors have overwhelmed IT departments with high-speed analytical software, proprietary high-performance hardware, and columnar-based data stores promising quick access and lightning-fast answers to ad hoc analytical queries. Forgotten in this blast of technology are the database administrators' most important responsibilities: backup and recovery.

2013-10-01

3,853 reads

External Article

The Big Data DBA

  • Article

Many IT enterprises are starting pilot projects to implement big data solutions. This article highlights what you need to know to check if you're ready to support these efforts, and integrate them into your current architecture, processes, and standards.

2013-08-02

3,540 reads

Blogs

AI: Blog a Day – Day 7: Vector and Vector Databases

By

Continuing from Day 6 we learned Embeddings, Semantic Search and Checks, on Day 7...

AI: Blog a Day – Day 6: Embeddings – How AI Understands

By

Continuing from Day 5 where we covered notebooks, HuggingFace and fine tuning AI now...

The Book of Redgate: Mistakes

By

This is kind of a funny page to look at. The next page has...

Read the latest Blogs

Forums

Why End-User Testing Is Even More Important with AI

By dbakevlar

Comments posted to this topic are about the item Why End-User Testing Is Even...

Dynamic Unpivot

By pietlinden

I have a table I didn't design that has tons of repeating groups in...

Writing as an Art and a Job

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Writing as an Art and...

Visit the forum

Question of the Day

String Similarity II

What is the range for the result from the EDIT_DISTANCE_SIMILARITY() function in SQL Server 2025?

See possible answers