Fair AI/ML 4 Endeavor to Bridge Gap between 2 and 2.414
We dedicated to solve the open problem left by Greedy Capture: Is there an algorithm that has better PROP guarantee ?
My Initial Immature Idea
Do some fine-tuning on results of Greedy Capture.
Pseudo code
- Greedy Capture
- // denotes “free” data points
- while 2-approximate Block Coalition
- all data points in that 2-approx Block Coalition
- all cluster centers associated with any data points in
- all data points associated with any cluster centers in
- cluster center of 2-approximate Block Coalition
- return
Idea is straightforward, directly eliminate 2-approx Block Coalition.
But how to continue, we need to rerun the Greedy Capture at some point.
Improve the Idea
Instead of using Greedy Capture as black box, we do some modification in it.
“Fine-tuning” Greedy Capture
In the while loop of Greedy Capture, we audit whether there exist 2-approx Block Coalition.
If there is one, we open the corresponding cluster center , and fix all point in the Block Coalition.
Set the fixed radius , denoting the distance from cluster center of BC to the -th nearest data point.
Open that cluster center.
Then run the Greedy Capture from the scratch neglecting all fixed point. We don’t synchronously increase until reaches .
Also auditing the 2-approximate in that Greedy Capture, if there are block coalition related to any point in cluster , we do the same recursively.
Pseudo Code
Fine-tuning Greedy Capture
Input: // denotes the fixed radius for 2-block coalition, denotes the center of 2-block coalition
while do
Smoothly and synchronously increase
If for other surpass , increase also.
while s.t.
while s.t. do
while s.t. is the center of a 2-block coalition
-th nearest distance from
return Fine-tuning Greedy Capture
return
Important: DOES IT WORK?
It is obvious that if the algorithm terminates, then we can find 2-approximate PROP clustering.
Running on the Example
On the example where no matter how the clustering is, the PROP approximation can not be better than 2. The algorithm exactly returns 2-PROP clustering.
On the example in which is the lower bound for Greedy Capture, The algorithm returns exactly PROP clustering.
Two symmetry parts, .
Consider one part that we only assign one cluster.
graph TB 1(x1) 2(x2) 3((a1)) 4((a2)) 5((a3)) 1---|1-ε|3 1---|2.414|4 1---|1-ε|5 2---|2.414|3 2---|1|4 2---|0.414|5
- First the algorithm open by Greedy Capture.
- Fine-tuning part finds that and are a 2-approximate block coalition, then fixes them, rerun the Greedy Capture from the scratch.
- Finally, are no longer opened. It is exactly PROP clustering.
Questions
There are a few question need to be answered.
❓How to prove that the algorithm always terminate?
❓Can this algorithm always find the best PROP clustering among all possible clustering?