About these posts on TDA
Topological Data Analysis (TDA) is exactly what it sounds like, using tools from topology to study data. I plan on writing a series of posts that takes us from the basics of topology to the current state of affairs in TDA, this is the first in that series.What is Topology
From a million miles away topology is the study of shapes, you may also remember that geometry is the study of shapes. So what is the difference between geometry and topology? Well, geometry cares about every little detail of a shape like, does it have corners, what is the curvature at a point, what are the distances between two points, angles, ect. Topology however only cares about the global properties of a shape, what is the basic shape of the object even if we smush it around a little bit? Below is an example, you can see that geometry cares that a square and a circle are different (one has corners ect) however topology only cares that both basically form a loop.To make this more formal we can say that topology is the study of the properties of shapes that are preserved under continuous deformations. So two shapes are considered the same if we can stretch or bend one to look like the other. It is exactly this flexibility that makes topology such a useful tool.
Motivation for TDA
Topology is a mature mathematical subject with many tools and techniques. The basic idea behind TDA is to use these techniques to learn something about a data set. Data sets may be very high dimensional making them impossible to visualize and hard to find qualitative information about. This is where topology comes in, the dimension of the data is in many ways irrelevant and the tools of topology give new types of "statics" for data sets.Basic outline for use of TDA
- Start with a data set
- Build a topological space (shape) out of the data
- Compute the (persistent) homology of the space (homology is a computable topological invariant of the space, we'll have alot more to say about homology later)
- Use the information from 3 to further investigate your data
No comments :
Post a Comment