Problem To Solve
It’s not always possible to tell what data a model was trained on, even if you have access to the model weights, because training models is a compression of training data. This introduces several challenges that do not exist in traditional software: • Complicates copyright • Harder to fairly compensate the owners of the data • Harder to know who trained each part of the model in settings where training is done by more than one party, e.g. Model Zoo • Easier to add biases in models Source: Mohamed Baioumy & Alex Cheema (AI x Crypto Primer)
Problem Solution
Make the training process itself verifiable. Build tools to break down how a model was trained and check if it contains a given piece of data. Several approaches can be explored: Integrate cryptographic primitives into the training process itself. For example, Pytorch NFT Callback hashes the current network weights, some metadata (data, accuracy, etc…) and your eth address every N epoch, which proves who did the model training. Note: This approach introduces a performance overhead to training models.
Inspiration
Source: Mohamed Baioumy & Alex Cheema (AI x Crypto Primer) Full credits go toward these two legends.
Verifiable Training
![avatar](https://hackquest-s3-prod-apne1.s3.ap-northeast-1.amazonaws.com/users/c15691bc-7980-4c2c-872d-19e475842e41/avatar_b808a1ac-c488-4fd7-b762-748d4ff89b31.jpeg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Credential=ASIAYCTGVDAPIAOORLIA%2F20240707%2Fap-northeast-1%2Fs3%2Faws4_request&X-Amz-Date=20240707T041318Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjECsaDmFwLW5vcnRoZWFzdC0xIkgwRgIhAJsgA%2FGg%2FbTPt2zw77xYGfyPdLivI2YKM9uVZJj9Z7YlAiEA9UvBF%2FRdvNxb4p7hEdjQbIRXK4b6hO9QctDsAnL9nyAqpwQI5f%2F%2F%2F%2F%2F%2F%2F%2F%2F%2FARABGgw1NTUzMzk4MTQ5NDIiDBCNLHYV7dxsrtSsgir7A91nprWTntzKqrzUwGnuY7hYf5dZ%2BQ8QLnhJSwlZOTi46HWPPQTSkupasP9%2BhCilbZW72nOsjk96IoaP3qPmEf4%2F44kbRhAEK0e5neYEwsFHJl1PrQ8r%2FyyBlE6ru3hH1YXYc0K%2Ffd9ENRlutDTtTkENHhYqYei4BySlaOyFFGjDTiFa6i5Ar6YF%2BqKB8WoAmhIhoVh%2FVLF%2BLkRA2NG0n6sNgx0Y%2ByQv2rBUcILlWhqRlLlTG47fyQP25NQySjyeQ4M11ckAeCPUHZA7N%2FLGOJVTHel6i%2B4xfAOPbM0fHe7zHSdM2oofhha6qloUxDoT8dOvkU76njKnI0br92kD1KncLZvZy5P%2FyRvoBHguHNizKP7X1o00QGnvUuS%2Bic0zO%2FTxtOfMAm4SPAleQmtnvpYLRT1vlqGu5OyuU5pUv7HT9TbYmTwTaUpyQVipm18D67eKEySEh89LBfY6LHKAbITC5OspN4v7oHQ3UyHTzBbqChHz8IsG38dTVmMx%2BJn%2BX43YJ9FePFjDljbUyG2%2F56jZLUO%2BliOSRZumCce8m36%2B%2B82lq8vB37ZTzpPclKTA1RutPbp5neodURo4YH0biGK7%2BavT3RYxugyVUhcFZWCEXm1schEG02bsYd4Fnayrbtrq8GlU63N6OyJ2g8qgf8ZGNl44y9UkAWVPWDC%2BmKi0BjqlATnw%2B%2F2ToNE4vKn1oku0FsCzbS2NkU%2FIvcVdZYKiUQR8xApS5IcocoVdrlwQKpNsh%2B192HDaxHEzMxWBqcdgRrejdLkjUy6fWPl7Q%2BluFehDT6EOFgBBFiaF2qID81NZ%2FG%2FT5b%2Be0KG%2Bah8ePP9Wtx%2Bt5J4bTigDj3bw0uQMFOl2Ob8CogC2vEpFueau1SDO6CY1oX71xxF%2Fjgyc%2FYZaRjQXfm%2FZ9w%3D%3D&X-Amz-Signature=c3debcb1a2ad5dc463e98dd1c58e9b354283e54c20e75c7ae59f1e2b52afce2e&X-Amz-SignedHeaders=host&x-id=GetObject)