External Article

Why Would I Ever Need to Partition My Big ‘Raw’ Data?

Whether you are running an RDBMS, or a Big Data system, it is important to consider your data-partitioning strategy. As the volume of data grows, so it becomes increasingly important to match the way you partition your data to the way it is queried, to allow 'pruning' optimisation. When you have huge imports of data to consider, it can get complicated. Bartosz explains how to get things right; not perfect but wisely.

External Article

How to Start Big Data with Apache Spark

It is worth getting familiar with Apache Spark because it a fast and general engine for large-scale data processing and you can use you existing SQL skills to get going with analysis of the type and volume of semi-structured data that would be awkward for a relational database. With an IDE such as Databricks you can very quickly get hands-on experience with an interesting technology.

Blogs

Optimizing Redshift Performance by Configuring WLM Queues

By

Efficient query performance in Amazon Redshift often comes down to how well you manage...

PowerShell Strikes Back: Return of the Loop

By

Welcome back to PowerShell Strikes Back. We’re three weeks in, and the training is...

Breaking Rules in Rome

By

Last week I was honored with a trip to Rome for the Redgate President’s...

Read the latest Blogs

Forums

Why Your Index Isn't Being Used? - Reading Execution Plans to Find the Real Culprit

By Sanket Parmar

Comments posted to this topic are about the item Why Your Index Isn't Being...

Distance Metric Algorithms

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Distance Metric Algorithms

What Can AI Really Do?

By Steve Jones - SSC Editor

Comments posted to this topic are about the item What Can AI Really Do?

Visit the forum

Question of the Day

Distance Metric Algorithms

What are the distance metric algorithms that can be used in VECTOR_DISTANCE()?

See possible answers